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This paper reviews the fabrication technology developed for 
Josephson integrated circuits and describes the techniques used in 
the standard process for various circuit elements including Nb/ 
AlO,./Nb junctions. These process techniques were verified based 
on the characteristics of the 8K-bit cell array chips fabricated for 
Process evaluation as being applicable to Josephson circuits that 
include several thousand junctions. Without the process technolo- 
gy based on the reliable Nb/AIO,/Nb junctions, the recently 
demonstrated ultrahigh-speed operation of Josephson integrated 
circuits would not have been possible. This paper also introduces 
advanced process techniques developed for future Josephson LSI 
application. 
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Josephson junctions with Nb/AIO,./Nb structures have made it 
possible to fabricate a variety of high-speed circuits, including an 
8-bit digital signal processor and 4K-bit memory . 

These circuits operate at ultrafast speeds and consume less power 
than any high-speed semiconductor circuits. An interface circuit 
which issues the signal from Josephson circuits to semiconductor 
circuits has also been fabricated. In short, it is possible to fabricate 
all components necessary for constructing a Josephson computer. 
Now the high-speed operation of the Josephson computer must be 
demonstrated. 
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Superconducting Quantum Interference Devices (SQUIDs) can 
measure magnetic fields as low as femto tesla and have been used 
for biomagnetism, resource surveying, and physical or geophysical 
measurement. Unlike the conventional SQUID, the single-chip 
SQUID is the first device that intergrates both a SQUID sensor and 
a feedback circuit on the same chip. This reduces the number of 
external cables. Also, the output can be processed with Josephson 
digital circuits in the cryogenic environment. These advantages 
open up the possibility of constructing a multichannel system with 
an array of more than 100 SQUIDs, which is required to measure a 
magnetic field map in detail. 
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Monolithic GalnAs twin PIN photodiodes were fabricated to 
realize excellent dual-detector balanced optical receivers in coher- 
ent optical communication systems. Introducing a backillumi- 
nated, flip-chip structure gives these photodiodes a small junction 
capacitance of 80 fF, a quantum efficiency of 75 percent at a 
wavelength of 1.54 um, and a cutoff frequency better than 
15 GHz. An optical input power level as high as 8 mW is obtained. 

A large fiber alignment tolerance of 60 um is achieved by inte- 
grating InP microlenses. The common-mode rejection ratio is 
better than —30 dB at a frequency of up to 10 GHz. The perfor- 
mance demonstrated is well suited to highspeed optical coherent 
communication systems. 
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In 1988, the Phase | standard of Synchronous Optical Network 
(SONET) was established by ANSI. Standards committees are 
planning for Phase II, which defines the functions for maintenance 
and operations, and which will be generally standardized in late 
1990. 

Fujitsu produced the Fiber Loop Multiplexer (FLM) series con- 
forming to the Phase | standard for the Regional Bell Operating 
Companies (RBOCs) before its competitors. The FLM was well 
received. 

This paper describes the SONET systems created by Fujitsu, 
Fujitsu’s plan to upgrade to Phase Il, and the key technologies 
used in the product. 
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Charging of a dielectric surface without corona discharge has 
been achieved using a conductive fur brush to which an electrical 
potential of several hundred volts is applied. In this method, it was 
observed that brush charging is accomplished by direct charge 
transfer, gas discharge, and triboelectric charging. Direct charge 
transfer, the primary charging process, was analyzed using an 
ohmic contact model. The charged potential is nearly proportional 
to that of the brush. When the charging time exceeds 0.3 seconds, 
the charged potential on the surface of the photoreceptor satu- 
rates at almost the potential of the brush. Brush charging is, 
therefore, useful as a low-voltage charging technique. 
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This paper investigate the use of organic materials for nonlinear 
optical devices as a means to providing a breakthrough in optical 
nonlinearities. For electro-optic devices, an improvement in sec- 
ond-order nonlinearity 10-100 times as large as that of LiNbO, 
is needed. 

For all-optical devices, an improvement in the third-order non- 
linearity over 10 times as large as that of ordinary polydiacetylene 
(PDA) is required. To achieve these objectives, several organic 
materials were designed based on newly proposed guidelines 
requiring that the balance between wave function overlap and 
separation (or difference) be optimized. Computer simulations 
show that the objectives may be attainable by controlling the wave 
function and forming quantum wells through adjusting donor and 
acceptor substitution sites in one-dimensional conjugated systems. 


UDC 621.3.049.75.002.2:681.32 
FUJITSU Sci. Tech. J., 27, 1, pp. 132-140 (1991) 


CIM System for PC Board Design and Manufacture 
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The importance of computer-integrated manufacture (CIM) in 
the design and manufacturing of printed circuit boards (PC 
boards) is widely acknowledged. It is difficult to increase the 
efficiency of schematic design, pattern layout design, or manufac- 
turing using CAD/CAM separately. This paper discusses the 
construction of a PC board CIM system for use as a key system in 
a company. It describes the background and scope of CIM, and 
explains the concept of CIM construction, showing examples. It 
also introduces the automated drawing input system for PC board 
design and the supporting system for mounting the parts. This sys- 
tem is advanced technology related to CIM construction. 
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This paper describes the high-speed processing achieved by the 
development of the structural analysis program POPLAS®/FEM5 
(called FEM5) on the multiple-pipeline architecture of the 
FUJITSU VP-series supercomputer. 

The CPU time has been reduced by using a method of solving 
simultaneous linear equations that takes advantage of performance 
capabilities of a supercomputer. This method is combined with a 
matrix column-row exchange method. Concentrated vector tuning 
is also used. 

The I/O time has been reduced by developing original |/O proc- 
essing. As a result, FEM5 has achieved superlative high-level, high- 
speed processing in the structural analysis field. 
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Josephson Integrated Circuits | 
Fabrication Technology 


@ Takeshi Imamura (Manuscript received September 3, 1990) 


This paper reviews the fabrication technology developed for Josephson integrated circuits 
and describes the techniques used in the standard process for various circuit elements in- 
cluding Nb/AIO,./Nb junctions. These process techniques were verified based on the charac- 
teristics of the 8K-bit cell array chips fabricated for process evaluation as being applicable to 
Josephson circuits that include several thousand junctions. Without the process technology 
based on the reliable Nb/AIO,./Nb junctions, the recently demonstrated ultrahigh-speed 
operation of Josephson integrated circuits would not have been possible. This paper also 
introduces advanced process techniques developed for future Josephson LSI application. 


1. Introduction 

Superconductivity has been studied ever 
since 1911). Although current flow without 
power consumption in electronics applications is 
certainly appealing, it took another fifty years 
before the application of superconductivity to 
electronics finally began. This is because the 
microscopic theory of superconductivity has 
unknown for a long time. Without a theoretical 
basis, it was difficult to develop superconduct- 
ing devices. Such difficulties were solved by 
two theoretical innovations introduced around 
1960. One is the quantum theory of supercon- 
ductivity proposed by Bardeen, Cooper, and 
Schrieffer in 1957”). The other is the supercon- 
ducting tunneling phenomenon theoretically 
predicted by Josephson in 1962%). This effect, 
called Josephson effect, was experimentally 
verified in 1963*). Based on such development 
in theoretical study, the application of super- 
conductivity to electronics has begun in ear- 
nest®)"”). The various types of proposed super- 
conducting devices are divided into three 
groups: the cryotron®?, Josephson junction 
devices”, and superconducting three-terminal 
devices'?’, Among them, only the Josephson 
junction features high-speed performance, which 
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has been experimentally demonstrated. Much 
effort has been focused on Josephson circuit 
technology. Even now, the application of 
Josephson junctions is limited to such applica- 
tions as voltage standard'!), magnetic sensor for 
medical use'?)”!*), and a mixer for radio astron- 
omy!5)-16) Discrete Josephson devices have 
been used in these applications. The integrated 
circuits for digital application remain in the 
research stage for two reasons: the ultrahigh 
performance not possible with semiconduc- 
tors has not been readily demonstrated using 
Josephson LSI chips, and even a small supercon- 
ducting digital computing system requires the 
further development of such peripheral technol- 
ogies as packaging and cooling. Recently, 
however, significant progress has been made in 
Josephson circuit technology'”)"!®). We expect 
to introduce a_ small-scale superconducting 
computing system in the near future. 

The Josephson junction devices use the 
superconducting tunneling effect, and, theoreti- 
cally, should achieve high-speed, low-power 
operation not possible with semiconduc- 
tors?°)?")_ Since the mid-1960s, IBM has spear- 
headed Josephson device research®)*”). During 
the mid-1970s to early 1980s, many organiza- 


T. Imamura: Josephson Integrated Circuits I: Fabrication Technology 


tions also joined the race to develop a Josephson 
computer??)?”). This work stopped abruptly 
when IBM ended its Josephson computer project 
in September 19837°) for two reasons. First, 
Josephson circuits were not much faster than 
semiconductor circuits. Secondly, it was diffi- 
cult to develop a high-speed cache memory. Ever 
since IBM lost its enthusiasm for developing 
Josephson junctions for digital applications, 
others have followed in kind. Research in Japan, 
however, has continued all along. One of the 
problems posed by Josephson junctions used at 
that time was the instability of the junction 
materials — lead alloys”®)”>"”. For one thing, the 
critical current of the junction increased with 
time during storage at room temperature. For 
another, the scattering of critical current was 
too large, even for junctions on the same chip, 
making it very difficult to manufacture LSI 
circuits. 

The introduction of reliable niobium (Nb/ 
AlO,./Nb) junctions at the end of 1983 to re- 
place the obsolete lead-alloy junctions changed 
the picture dramatically. The Nb/AIO,/Nb junc- 
tion as developed by Gurvitch et al??). was 
further improved by Morohashi et al?*”34). The 
introduction of niobium junctions solved most 
of the problems hindering process reliability. 
Many kinds of fabricated logic and memory 
circuits could operate at much higher speeds 
than semiconductor circuits. Recently, a 
Josephson gate established a 1.5-ps gate delay 
at a power dissipation of only 12 uw>>). The 
world’s first Josephson microprocessor operated 
at a clock frequency of 770 MHz®®. An 8-bit 
digital signal processor with 6 300 gates operated 
at 1 GHz*”. Josephson digital circuit technol- 
ogy was also applied to the superconducting 
quantum interference device (SQUID) magne- 
tometer!?)-14), where the Josephson junction 
had been used as an analog magnetic sensor. 
This enabled us to develop a single-chip SQUID 
including a feedback circuit?®). These develop- 
ments have been made possible by the high 
quality and excellent uniformity of junction 
characteristics. 

This paper (Part I) reviews recent develop- 
ments in Josephson circuit technology achieved 


by our laboratories, and describes the fabrica- 
tion technology of Josephson circuits based on 
Nb/AIO,./Nb junctions. Part II’? describes the 
high-speed performance of fabricated Josephson 
integrated circuits, and Part III°®) describes the 
breakthrough made in SQUID magnetometers. 
Chapter 2 of this paper (Part I) reviews the 
Josephson junction materials investigated thus 
far. Chapter 3 describes the standard fabrication 
process using Nb/AlO, /Nb junctions. Chapter 4 
describes the advanced process technology being 
developed for future Josephson LSI application. 
In conclusion, chapter 5 summarizes this paper. 


2. Junction materials 

The Josephson junction is made of a thin 
tunneling barrier sandwiched between two su- 
perconducting films called the base and counter 
electrodes. See references 40 and 41 for the 
principles of Josephson junction. Figure la) 
shows a cross section of the junction, and 
1b) shows the current-voltage (I - V) characteris- 
tic. The maximum superconducting tunneling 
current is called critical current (/,). When the 
bias current exceeds /J,, the junction switches 
from zero-voltage state A to voltage state B. 
Thus, J, is one of the most important device 
parameters used in circuit design. In most 
Josephson junctions, the tunneling barrier is as 
thin as a few nanometers. Since J, decreases 
exponentially with the barrier thickness, barrier 
formation integrity is essential for obtaining 
good junctions. 

Circuit fabrication requires the use of stable, 
uniform, high-quality, and reproducible junc- 
tions. 


Superconductor 1 


Tunneling barrier 


Current (/) 
> 


Superconductor 2 


0 
Ioap voltage 
Voltage (V) 
b) I-V characteristics 


a) Structure of 
Josephson junction 
Fig. 1—Principles of the Josephson junction. 
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1) Stable 

The junction characteristic is stable against 
thermal cycling between 4.2 K and room tem- 
perature. Stability during long-term storage at 
room temperature is also crucial. 
2) Uniform 

I, variation is small (typically less than ten 
percent) for junctions ona chip. 
3) High-quality 

Small leakage current in the sub-gap voltage 
region for the I-V characteristic. Parameter V, 
is normally used to determine junction quality, 
as described later. 
4) Reproducible 

Expected junction characteristics, particu- 
larly the crirical current, are reproducible 


through the process. 
Throughout the 1970s, lead alloy materials 


have been used as junction electrode materials. 
Native oxide grown by plasma oxidation was 
used for the junction barrier??). Much effort has 
been focused on improving the reliability of 
lead-alloy junctions*?)"*5), By the early 1980s, 
however, it became clear that unreliable lead- 
alloy junctions could not provide the Josephson 
integrated circuits required?) For example, Jc 
of the junction continues changing during long- 
term storage even at room temperature, and 
excessive /, variation hinders integrated circuit 
operation. Moreover, the initial junction yield 
is not high enough to facilitate use in circuits. 
Typically, one percent of the junctions is initial- 
ly shorted. Such unreliable characteristics are 
due to large-grained, corrosive, and mechanically 
soft lead-alloy electrodes. 

Much work has gone into replacing lead- 
alloy electrodes with such refractory materials as 
Nb and NbN*®), which are much more fine- 
grained, hard, and stable. Table 1 lists the ‘‘all- 
refractory” junctions reported thus far. ‘‘All-re- 
fractory’”’ means that both the base and counter 
electrodes are made of refractory materials. The 
sequence indicated by slashes (/) refers to 
“counter electrode/barrier/base electrode’. V, is 
the gap voltage, and V,, is the quality parameter 
defined by the product of the critical current 
and subgap resistance at 2mV. Recently, V,, 
for NbN junctions has been defined at 3 mV be- 
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Table 1. All-refractory Josephson Junctions 


Junction Vg(mV)| Vp Note )(mvV) Ref. 
Nb/oxide/Nb | (2.8 6-8 47, 51 
Nb/Au/oxide/Nb 2.6 26 54 
Nb/a-Si/Nb 2.8 28 55-58 
Nb/AIO,/Nb 2.8-2.9 40-70 32), 33 
Nb/ZrO,./Nb 2.8 50 59 
Nb/YbO,./Nb 2.8 15 60 
Nb/TaO,./Nb 2.8 25 61 
Nb/Si,,N,/Nb 2.8 16 62 
NbN/oxide/NbN_ | 4.0-4.4 15-25 63, 64 
NbN/a-Si/NbN 4.4 10-12 65, 66 
Nb/p-Ge/NbN 4.0 35-48 68 
NbN/MgO/NbN_ | 5.1-5.4 45-5QNote2)) 79.73 


Note 1): Vj, defined at 2 mV. 
Note 2): V,, defined at 3 mV. 


cause the Nb junction has larger Vz than Nb 
junctions. Junctions to be used in Josephson 
integrated circuits require V, exceeding 
~30 mV to afford a wide circuit’s operating 
margin enough for practical use. 

In early studies, Nb was used for the base 
electrode alone, and the Nb, O, barrier enjoyed 
some success*”)"5%, Attempts to use all-refrac- 
tory junctions, however, were less than success- 
ful. High-quality junctions feasible for circuit 
applications were difficult to fabricate, and V,, 
was less than 10 mV*’"5"). The subgap con- 
ductance was too large for use in circuits. This 
means that the tunneling barrier contains 
pinholes, inducing leakage current, apparently 
due to the chemical reaction between the Nb 
counter electrode and Nb, O, barrier. Studies on 
the interface between the Nb electrode and 
oxide barrier®?”**) resulted in a thin Au layer 
being introduced to minimize subgap conduct- 
ance”), The highest attainable Vm (26 mV) still 
proved insufficient for use in Josephson LSI. 

Studies on artificial barrier materials other 
than native Nb,O; have been challenged. *‘Arti- 
ficial” refers to the intentional depositing of 
barrier materials on the base electrode. Studies 
on barrier materials that react less at the inter- 
face with Nb electrodes have tended to take one 
of two approaches: Using a semiconductor 
barrier or using a metal oxide barrier. In the first 
approach, an amorphous silicon barrier offered 
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the best characteristics V, = 28 mv)**®, 
With the second approach, some of the many 
metals studied were found to function as 
good junction barriers when adequately oxi- 
dized>”) »59) -62) | Among them, the AIO, barrier 
has outstanding quality>” >), Vm reproducibly 
exceeds 40 mV while J, uniformity and con- 
trollability are far superior to those of any other 
refractory junctions. Not surprisingly, recent 
developments in Josephson circuits have pro- 
ceeded almost exclusively using Nb/AIO, /Nb 
junctions. 

A possible Nb alternative also receiving 
much attention has been NbN*®. The use of 
NbN, which has a higher critical temperature, 
results in large Vz and smaller subgap conduct- 
ance. NbN is also less chemically reactive than 
Nb and less sensitive to oxygen contaminants — 
both advantages in making high-quality junc- 
tions — but the coherence length of NbN is 
shorter than Nb, necessitating the formation of a 
thinner tunneling barrier on NbN. The greater 
penetration depth in NbN also reduces the junc- 
tion’s switching sensitivity. Of the many barrier 
materials used for junctions with NbN elec- 
trodes®?)"74) , MgO has shown the highest quali- 
ty —Vm of 45mV to 50mvV797"), Junction 
quality generally depends on the critical current 
density, particularly in NbN/MgO/NbN junc- 
tions. Thus, the V,, in Table | is selected from 
junctions with a practical current density greater 
than 1 000 A/cm?. 

The all-refractory junctions used to fabricate 
a variety of logic and memory circuits thus far 
are Nb/AIO,/Nb'7»'®), NbN/oxide/NbN”®?, 
and NbN/MgO/NbN’°®. Most Josephson circuits, 
however, have been made of Nb/AIO,/Nb. We 
have made significant progress in Josephson 
circuit technology development based on Nb/ 
AlO,./Nb junctions, mainly due to their high 
quality and reproducibility. 

The following chapters also introduce fabri- 
cation using the Nb/AlO,/Nb junctions devel- 
oped in our laboratories. After a brief descrip- 
tion of the standard integrated circuit 
fabrication process, this paper discusses the 
advanced techniques developed for future 
Josephson LSI application. 


Nb counter electrode 


AlO, tunneling barrier 


n SU 


Nb base electrode 


Nb ground plane 


Si substrate 


Fig. 2—Integrated circuits with Nb/ AlO, /Nb Josephson 
junctions. 


3. Standard process 
3.1 Outline 

Josephson integrated circuits consist of the 
Nb/AIO,./Nb Josephson junctions, SiO, insula- 
tors, Mo resistors, Nb ground plane, and Nb 
wiring, usually formed in nine to eleven layers 
on Si substrates’”). Each layer is patterned 
through a photoresist mask by reactive ion 
etching (RIE). 

As shown in the typical cross section in 
Fig. 2, Josephson circuits are usually fabricated 
above a superconducting ground plane. The Nb 
wiring formed above the ground plane composes 
a superconducting strip line. This is necessary to 
enable high-speed signal transmission between 
adjacent junctions. Resistors that are used as 
loads or as damping resistors’®) in junction 
switching contact the base electrode. Contact 
holes are formed in insulators above the junction 
counter electrodes to ensure electrical contact 
with subsequent wiring. The contacts between 
the base electrode and ground plane, which are 
not shown in Fig. 2, are necessary for Josephson 
circuits. Contact is enabled by the base electrode 
through a hole in the insulation layer above the 
ground plane. In logic circuits designed using the 
MVTL gate family’?), wiring is set in the top 
layer. In memory and some types of logic cir- 
cuits, two additional layers are formed on the 
wiring, i.e. insulators and control lines. 

Table 2 lists the layer materials and thick- 
nesses. Nb, Al, Mo and SiO, are deposited by 
sputtering under the conditions listed in Table 3. 
The SiO, used as a protective (P) layer for Mo 
resistors during RIE of the Nb base electrode is 
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Table 2. Circuit layers 


Layer Material |Thickness (nm) 
GP Ground plane Nb 300 
I, Insulation SiO, 300 
R_ Resistor Mo 100 
P_ Resistor protection | SiO or SiO, 100 
B_ Base electrode Nb 200 
— Tunneling barrier AlO,,-Al ri, 
C Counter electrode Nb 100-200 
I, Insulation SiO, 400 
WR Wiring Nb 600 
I; Insulation SiO, 800 
CL Control line Nb | 1000 


Table 3. Metal and insulation layer sputtering 


Material Ar ie a 
Nb 13-23 200 
Al 13 8 
Mo 0.67 130 
SiO, 1.3 8 


Table 4. Metal and insulation layer patterning 


Power 
Material Reactive gas Pressure density 
(Pa) (W/cm?) 
Nb CF, (5-20 % O27) 2.7-6.7 0.10 
Al Ar 0.7 0.15 
Mo CF, (5 % O2) 6.7 0.10 
SiO, CHF; (0-15% O2) 2.0 0.20 


deposited by evaporating SiO in an oxygen 
atmosphere®®?. All layers except the P layer are 
patterned by RIE. Table 4 lists the reactive gases 
used in RIE. Because Al and AIO, are not 
etched by reactive CF, and CHF3, they act as 
an etching stopper. The thin AlO,-AlI barrier is 
removed by Ar sputter etching. The SiO, used 
for the P layer is patterned by lift afie*. 
Although the key process in Josephson LSI 
development is producing high-quality junctions, 
the other circuit elements are also important. 
These elements include the insulation layer, 
superconducting wiring resistors, and contacts 
between two Nb layers. The following sections 
briefly describe the standard fabrication process 
optimized for Nb/AIO, /Nb junctions, SiO, insu- 
lators, Nb wirings, contacts, and Mo resistor. 
The final section evaluates the standard process 


FUJITSU Sci. Tech. J., 27, 1, (April 1991) 


Fig. 3—Nb/ AlO,./Nb Josephson junction. 


in full vertical structures from the ground plane 
to the control line, both at 300 K and 4.2 K. 


3.2 Nb/AIO,. /Nb junctions 

3.2.1 Excellence of Nb/AIO,./Nb junctions 

Figure 3 shows a cross section of the Nb/ 
AlO,./Nb junction used in the standard process. 
The required junction characteristics are stabili- 
ty, uniformity, reproducibility, and high quality 
as described in Chap. 2. On these characteristics, 
the Nb/AIO,/Nb junctions offer the best I-V 
characteristics among all refractory junctions. 
This excellence of the Nb/AIO,/Nb junctions is 
due to the following four properties. First is the 
stability of refractory Nb with its fine-grain, 
smooth surface. Second is the affinity of the 
thin Al to the underlying Nb. An Al layer a few 
nanometers thick can wet the Nb surface®!)~§4), 
Third is the integrity of the AlO, barrier formed 
on Al. The uniform AlO, 2 nm thick is repro- 
duced by introducing oxygen into a vacuum 
chamber. Fourth is that the AlO, barrier is not 
damaged by the subsequent deposition of the 
counter Nb electrode. In addition to these 
inherent material properties, the development of 
the whole wafer process®?)»55) is vital to en- 
hance junction process reliability. In this proc- 
ess, the Nb/AlO,/Nb multilayered structure is 
deposited in a vacuum run to eliminate contam- 
inants around the junction barrier. 

To reproduce high-quality Nb/AIO,/Nb 
junctions, we have optimized the various process 
parameters listed in Tables 3 and 4. In particu- 
lar, the deposition of the Nb/AIO,./Nb trilayer 
structure is essential to obtaining high-quality 
junctions. The depositing parameters of Nb and 
Al were observed to affect the junction charac- 
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BK 1. @hUw 


c) 3.3 Pa 


Fig. 4—SEM photographs of Nb films deposited at 
different Ar pressures. Film thickness is 600 nm. 


d) 4.0 Pa 


teristics. Reference to details on optimization is 
made in some papers**)®)-8”). The following 
subsections briefly describe two points affecting 
the junction characteristics: the quality of Nb 
electrode*®) and the coverage of thin Al on 
Nb®°*)87) The standard process for junctions 
and their characteristics is described below. 

3.2.2 Nb electrodes 

The Nb electrodes were deposited with dc 
and rf magnetron sputtering. A comparison of 
the junction characteristics concluded that junc- 
tions with dc-sputtered Nb are superior to those 
with rf-sputtered Nb**). This difference is due 
to the interdiffusion between the lower Nb and 
thin Al layers. Based on these results, our re- 
search has focused on optimizing the parameters 
of dce-sputtered Nb. 

The film characteristics of dc-sputtered Nb 
were studied in terms of surface morphology, 
stress, crystal structure, and superconductivity. 
These characteristics change depending on the 


Tensile 


Stress (x 10° dyn/cm?) 


Compressive 


0 1 2 3 4 5 
Ar pressure (Pa) 


Fig. 5—Nb film stress versus Ar pressure during sput- 
tering. 


200 


Line width (~7m) 


Ar pressure (Pa) 


Fig. 6—Critical current J, of Nb wirings versus Ar 
pressure. 


sputtering parameters, typically on the Ar pres- 
sure. 

Sputtered Nb film has a columnar structure 
with polycrystalline grains, where the Nb(110) 
plane is oriented in parallel to the substrate 
surface. Figure 4 compares scanning electron 
microscopy (SEM) photographs of Nb deposited 
at different Ar pressures. The surface morpholo- 
gy of sputtered Nb features a fine filamentous 
texture. At higher Ar pressures, however, the 
surface morphology deteriorates as small grains 
of increased density appear in the texture. 
Figure 5 shows the intrinsic stress of sputtered 
Nb changes as a function of Ar pressure. The 
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stress changes from compressive to tensile as the 
Ar pressure increases. At around 2.3 Pa, stress- 
free Nb is obtained. Figure 6 shows the super- 
conducting critical current of 600-nm Nb wiring 
as a function of the Ar pressure. The supercon- 
ductivity of Nb film gradually deteriorates above 
a threshold Ar pressure of 3 Pa. 

When considering the surface morpholgy 
and superconductivity of sputtered Nb, films 
deposited at low Ar pressures are regarded more 
suitable for junction electrodes. Based on these 
results, the Ar pressure for the standard process 
was fixed at 1.3 Pa, although it is a little com- 
pressive. For the characteristics of junctions 
larger than 1 wm, however, we observed no 
effects caused by the compressive stress of Nb 
film. To make junctions smaller than 1 um, 
the use of stress-free Nb was considered crucial 
to achieving high-guality junctions, as described 
in Sec. 4.1. 

3.2.3 Coverage of Al on Nb 

Since Al is several nanometers thick, its 
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Fig. 7—Al 2p spectra for air-oxidized Al/Nb/Si. 
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Fig. 8—Nb 3d spectra for air-oxidized Al/Nb/S: 
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coverage on the underlying Nb is critical to 
grow a uniform AIO, barrier without pinholes. 
The coverage of thin Al on Nb was studied by 
using X-ray photoelectron spectroscopy 
(XPS)°°87). Figure 7 compares the Al 2p 
spectra for Al/Nb/Si of different Al thicknesses. 
Spectra are normalized by the peak height of Al 
oxide. For Al thicker than 2.1 nm, a peak of 
elemental Al is clearly observed. Figure 8 
compares the Nb 3d spectra for the same sam- 
ples. For Al thicker than 2.1 nm, peaks of 
Nb,O, disappear, and only the elemental Nb 
peaks remain. Figure9 summarizes the film 
growth of thin Al on Nb based on the values 
above, with XPS studies on Al/Si samples also 
indicated. For Al thinner than 1.4 nm, the AI is 
completely oxidized, and underlying Nb is 
partly oxidized. For Al thicker than 2.8 nm, the 
Nb surface is compeletely covered by elemental 
Al. Consequently, the Nb is not oxidized. The 
Al formed on Nb should be thicker than 2.8 nm 


me : Nb,O, 
Cw 


Al thickness (nm) 
| i. i. i 


Al 


b) Al/Nb/Si 
Fig. 9—Film growth of Al deposited on Nb. 
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Substrate 


a) Deposition of trilayer d) Deposition of insulator 


———— 


b) Definition of 
junction area 


os ety 


f) Deposition and 
patterning of wiring 


e) Formation of contact hole 


c) Patterning of base electrode 


Fig. 1O—Nb/A1O,./Nb Josephson junction process 
se quence. 


to obtain a clean Al/Nb interface. Conversely, 
when Al is deposited on Si, Al thicker than 7 nm 
can cover the Si surface. This indicates the 
affinity of Al to Nb is much better than that to 
Si. This enables formation of a thin Al barrier on 
Nb. 

The I-V characteristics were examined for 
junctions of different Al thicknesses. Good 
characteristics were obtained for Al 2.8-10 nm 
thick. For Al thicker than 10 nm, the I-V char- 
acteristics gradually deteriorate due to the prox- 
imity effect on the Al/Nb interface. Based on 
these results, a standard Al thickness of 7 nm 
was established. 

3.2.4 Junction process 

Item a) to f) in Fig. 10 indicate the sequence 
of Nb/AI1O,./Nb junction fabrication. 

a) Deposition of trilayer 

Before trilayer (Nb/AlO,/Nb) deposition, 
substrates are etched by Ar sputtering to contact 
the Nb ground plane through contact holes. 
The trilayer is then deposited by sputtering on 
the substrates attached to a holder cooled by 
circulating water. To ensure good thermal con- 
tact, copper backing plates and indium foils are 
used as spacers between the substrates and hold- 
er. The base pressure is less than 2 x 1075 Pa. 
After the Nb base electrode and 7-nm AI barrier 


Ver. : 

0.2 mA/div 
Hor. : 

1 mV/div 


a) Single MVTL OR gate 


Ver. : 

0.1 mA/div 
Hor. : 
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b) 100 serially connected gates 


Fig. 11—I-V characteristics. 


are sequentially deposited, Ar+ 10%O, gas is 
introduced into the chamber to form a thin 
oxide 2nm thick on the Al. The junction’s 
critical current density (j,), depends on the 
pressure (usually 50-200 Pa) and oxidation 
time (typically 30-60 min). j, ranges of 
500-20000 A/cm*. When j, is designed at 
2000 A/cm?, the averages of j, range of 
1500-2500 A/cm? for different waters. After 
oxidation, the Nb counter electrode is deposited 
on the AIO, barrier. Next, the trilayer is 
trimmed into a device structure. 
b) Definition of junction area 

The junction area is defined by patterning 
the Nb counter electrode. In RIE, AlO, and Al 
act as good etching stops against CF, gas. The 
remaining AlO,-Al barrier is removed by Ar 
sputter etching. 
c) Patterning of base electrode 

The lower Nb is then etched into a base 
electrode pattern with CF,. 
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d) Deposition of insulator 

Junctions are covered with an SiO, insula- 
tion layer. Section 3.3 describes the sputtering 
of SiO,. 

e) Formation of contact holes 

Contact holes above the counter electrodes 
are formed in the SiO, by RIE with CHF;. 
f) Deposition and patterning of wiring 

Nb is deposited and patterned as a wiring 
layer. 

3.2.5 Junction characteristics 

The j, uniformity is sufficient for circuit 
application. The standard deviation of j, is 
about three percent for 2-um-square junctions 
on a chip. The variation of j, in different fabri- 
cated waters increases even in the same lot. 

Figure 11 shows the I-V characteristics of 
modified variable threshold logic (VMTL) 
gates’”). The gate junction is paired, 2.5 um 
and 4.04m in diameter. The gap voltage is 
2.9 mV. Quality parameter V,, exceeds 50 mV. 
The maximum-to-minimum spread of the critical 
current is +5% for 100 gates. 

Nb/AIO,/Nb junctions are very stable in 
thermal cycles between 4.2 K and 300 K, and in 
storage at room temperature. The I-V charac- 
teristics do not change during the first few 
years of storage, but are affected by annealing. 
Figure 12 shows changes in j, and V,, after 
60 minutes of annealing in nitrogen at different 
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Fig. 12—Deterioration of Josephson junctions after 
annealing. 
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temperatures. The junction starts deteriorating 
with annealing above 200°C. Critical current 
density begins decreasing after annealing at 
175 °C. These changes are considered caused 
by the change in AIO, composition and/or 
the interdiffusion of Al and Nb>4)’8%): 89, 

Deterioration due to annealing is one of the 
most critical problems regarding the reliability 
of Nb/AlO,/Nb junctions. This makes it diffi- 
cult to further develop Josephson circuit tech- 
nology for two reasons. One is that the margin 
in process temperature is not very large; 175 °C 
is only 55°C higher than the typical resist 
baking temperature. The other is that the 
annealing characteristic limits the application 
of various semiconductor process techniques. 

Thus, the annealing characteristics should 
be improved for further development in 
Josephson circuit technology®”. 


3.3 SiO» insulators 

In lead-alloy Josephson circuits, SiO or SiO, 
films were used for insulation layers?) , but the 
integrity of such films is not sufficient for Nb 
circuits. Figure 13 shows the breakdown voltage 
in SiO and si0,™. The insulator is 300 nm 
thick, and the electrode is 4 mm square. Using 
sputtered Nb for the upper electrode reduces the 
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0 100 200 
Breakdown voltage (V) 


Fig. 13—Breakdown voltage in 300-nm SiO and SiO, 
films. 


T. Imamura: Josephson Integrated Circuits I: Fabrication Technology 


breakdown voltage to almost zero due to micro- 
cracks induced by heat or stress during refrac- 
tory Nb deposition. 

Sputtered SiOz was used as an insulator in 
Nb/AIO,./Nb Josephson  circuits””»°°?, The 
breakdown voltage in sputtered SiOz (see 
Fig. 14) exceeds 200 V even when Nb is used for 
the upper electrode. When Nb is used for both 
upper and lower electrodes, the breakdown 
voltage of the sandwiched SiO, is reduced to 
one third the above value, though still exceeding 


Si substrate 
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0 100 200 300 
Breakdown voltage (V) 


Fig. 14—Breakdown voltage in 300-nm sputtered SiO, 
films. 
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Fig. 15—I-V characteristics and cross sections for 
junctions. 


10 


50 V. This is high enough for use in practical 
Josephson integrated circuits, where the junc- 
tion voltage is only 3 mV. The deterioration of 
SiO, deposited on Nb is due to the differences 
in the initial SiO film growth between on Si 
and on Nb. 

The Nb/AIO, /Nb junction deteriorates when 
annealed at temperatures exceeding 200 °C. 
When depositing SiO, on the junctions, 
thoroughly cooling the substrates during SiO, 
sputtering becomes critical for protecting the 
junctions against thermal damage. For this 
purpose, the deposition rate of SiO. was reduced 
to 8nm/min. During sputtering, the substrates 
were attached to a water-cooled holder with 
indium foils. Through these measures, the 
application of SiO, in the junction process 
was found to cause no deterioration in the junc- 
tion characteristics. Figure 15 compares junc- 
tions with and without an 800-nm SiO, insula- 
tion layer (3). The junction quality (Vp) 
exceeds 40 mV for both junctions. No deteriora- 
tion in the characteristics were observed after 
depositing a thick layer of SiO, on the junc- 
tions. Based on these results, sputtered SiO, has 
been applied in the standard Josephson circuit 
process. The use of sputtered SiO, resulted in a 
significantly improved production yield for 
Josephson circuits. This was demonstrated in the 
yield evaluation for 8K-bit cell array chips?” , as 
described in Sec. 3.7. 


150 


Line width (4m) 


100 


I, (mA) 


50 


0 200 400 600 800 


Film thickness (nm) 


Fig. 16—Critical current J, of Nb wiring lines wo 
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3.4 Nb wiring 

In Josephson circuits, superconducting Nb is 
used for the wiring. As circuits become more 
densely integrated, long, narrow wiring lines 
are needed for circuit construction. Figure 167” 
shows the superconducting critical current 
(J.) measured for Nb wiring. The lines are 
1.0-2.0 um wide and 10.88 mm long. For films 
thicker than 400 nm, /, only depends on line 
width, not film thickness. For films thinner than 
200 nm, however, /, changes linearly with the 
thickness, apparently due to supercurrent flow- 
ing in the film surface up to the London 
penetration depth. 
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Fig. 17—J/, of wirings versus the thickness of underlaying 
Nb base electrode, 
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Fig. 18—Changes in /, of wiring after annealing. 
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The /, of Nb wiring is much greater than the 
current level of Josephson gates (typically less 
than 1 mA). In actual circuits, however, Nb 
wiring should run over steps of underlying 
layers. The J, decreases steeply with the thick- 
ness of underlying layers (see Fig. 17). This is 
due to an effectively decreased film thickness 
at steps. Thus, the J, of wiring at steps will be 
enhanced by improving the Nb coverage at steps. 
This was actually verified by applying bias- 
sputtered Nb to wiring as described in Sec. 4.4. 

Another problem posed by Nb wiring is its 
annealing characteristics*®). The I, of wiring 
decreases with the annealing temperature as 
shown in Fig. 18, even when annealed in nitro- 
gen. The reduction in J, was observed to be 


closely related to the oxygen diffusion in 
Nb®5)-89 


3.5 Contacts 

Josephson circuits require contacts between 
two Nb electrodes. With increased circuit inte- 
gration, smaller contacts with higher critical 
currents are needed. Contacts are made through 
holes formed in the insulating SiO, layers. The 
critical current of 200 serially connected con- 
tacts is plotted in terms of the contact area 
shown in Fig. 197”). Before Nb wiring deposi- 
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Fig. 19—Critical current /, in contacts formed between 
77 
Nb electrodes 
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Fig. 20—Sheet resistance in Mo resistors. 


tion, the Nb base electrode is cleaned by Ar 
sputtering at 1.3 Pa for one to three minutes to 
remove the native oxide formed on Nb. The 
sputtering power density was 1.9 W/cm?. The 
base electrode is 200 nm thick and the wiring 
layer 800 nm thick. The critical current for 
contacts changes almost linearly with the 
contact area. Even for small contacts 1.0 um 
square, the 20-mA critical current is much larger 
than the current flow in actual circuits. Also 
note that such contacts face the same problem 
as the wiring: /, deterioration due to annealing. 
Section 4.4 describes how the annealing stabili- 
ty of contacts has been improved. 


3.6 Mo resistors 

Mo films deposited by sputtering form resis- 
tors. Figure 207’) shows the measured sheet 
resistance (R,) plotted against thickness (qd). 
Rs, is proportional to d~'*®. The spread of 
resistance on a wafer is as small as several 
percent, and the contact resistance between Mo 
and Nb is negligible. 

The Mo resistor requires a protective layer 
of SiO,, which is patterned by lift-off. The 
damping resistor used in the gates makes it 
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Fig. 21—Cell layout for the 8K-bit memory cell array 
chip. The unit cell is indicated by a bold rectan- 
gle. 


difficult to reduce the gate size. Because the 
SiO, lift-off enforces a rather large patterning 
margin. Thus we are developing new resistor 
material to make the protective layer of SiO, 
unnecessary. 


3.7 Process evaluation 

The previous sections described the inde- 
pendent evaluation of the elements composing 
Josephson circuits. In actual circuits, however, 
such evaluations are not independent, but affect 
each other in complex ways. Thus, it is neces- 
sary to evaluate the characteristics of each ele- 
ment in full vertical structures, including the 
layers from the ground plane to the control lines. 
For this purpose, we have evaluated 8K-bit 
memory cell array chips from the early stages of 
our Josephson research?). The junctions tested 
thus far are lead-alloy (Pb-Bi/Pb-In-Au) junc- 
tions, Pb-Bi/Nb junctions, and Nb/AIO, /Nb 
junctions???) , 

The tested cell is a 2-junction interferometer 
with three control lines (see Fig. 21). The junc- 
tion size is 5 x 14 um? and the minimum line 
width is 2 wm. Figure 22 shows the fabricated 
5x5mm? cell array chip containing 8 192 
cells and 87 pads. The fabricated chips were 
tested at two stages. First, the chip yield was 
measured at room temperature by using an 
automatic prober. Table 5 lists the test items. 
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1mm 
Fig. 22—Microphotograph of the fabricated 8K-bit 
memory cell array. 


Table 5. Tested items in the prober test at room 


temperature 
Combinations 
Item Content af pads 
Aas SA | SQUID array 36 
a cl CL | Control line 32 
Total : 68 
SG _ | SQUID array-GP 20 
CC | CL;-CL; 20 
I lati 
ae SC | SQUID array-CL; 48 
CG | CL;-GP 15 


The continuity of the interconnecting wiring 
was checked for 68 combinations of selected 
pads, and the insulation of the insulating layers 
was checked for 100 combinations. Secondly, 
the I-V characteristics were measured for the 
chips selected through room-temperature testing. 
The yield of cells without shorted junctions was 
evaluated, as well as variations in the cells’ /,. 
Table 6 lists the yields for checks made dur- 
ing the room-temperature testing. By using SiO, , 
the yields are significantly improved. Figure 23 
shows the yield of the insulation check plotted 
in terms of the area of the insulators sandwiched 
by two electrodes. Plots of SiO, were obtained 
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Fig. 23 —Yield of the insulation checks versus the area of 
the insulators. 


Table 6. Yields for the tested items 
Yield (%) 
sa | cL | sc |cc | sc | cG 


Insu- 
lator 


Junction 


SiO 
Nb/AIO,/Nb 


Table 7. Yields for combinations of the tested items 


; Yield (%) No. of 
Junction Insulator SA*  CL* | perfect chips 
Pb-Bi/Pb-In-Au _SiO,. 0 
Pb-Bi/Nb SiOx 0 


Nb/A10,./Nb SiO, 


for the Nb/AIO,/Nb junctions, and plots of 
SiO, and SiO were obtained for the Pb-Bi/ 
Pb-In-Au junctions. The defect density (D) of 
the insulators was obtained from the slope of 
plots: 0.06 mm~ for SiO,, 0.3 mm for SiO,, 
and 1.0 mm~? for SiO. These results are consis- 
tent with the breakdown voltages for SiO, , SiO, 
and SiO. 

The total yield for the SQUID array and 
control line (SA* and CL*) are calculated by the 
yield obtained for related checks; 

SA* = SA x SG x SC, 

CL* =€L* CC x SC x SG. 
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Table 8. Failure rates of the Nb/ AlO,./Nb cells 


; No. of Failure rate 
Chip No. | No. of cells shorted calis (%) 
1 | 8192 2 0.024 
2 8192 0 0 
3 8192 4 0.048 
4 8192 0 0 
5 8 192 0 0 
6 8 192 2 0.024 
Total 49 152 8 0.016 


Table 9. Average cell failure rates for three kinds of the 
junction material 


Functions No. of cells No. of Failure rate 
measured |shorted cells (%) 
Pb-Bi/Pb-In-Au 24 576 260 1.06 
Pb-Bi/Nb 65 536 83 0.13 
Nb/AIO,./Nb 49 152 8 0.016 


Table 7 lists SA* and CL*, along with the 
results for the Pb-Bi/Pb-In-Au and Pb-In-Au/Nb 
junctions. The yield for Nb/AlO,/Nb junctions 
proved sufficiently high to obtain six perfect 
chips among the 64 tested. 

The low-temperature characteristics were 
measured for the selected six chips. Eight cells 
were observed to be shorted in 49 152 cells (see 
Table 8). Thus, three of the six chips had no cell 
failure. The average cell failure rate was 0.016 
percent. From these results, we estimated a prac- 
tical yield of 27 percent for the 8K-bit cells. 

Table 9 compares the cell failure rates for 
the three kinds of junction materials. The use of 
refractory Nb for the base and counter elec- 
trodes decreases the failure rate by one order. 

Figure 24 shows the /, distribution of 8 192 
cells. All cells are within the range of +13%, and 
the standard deviation of /, distribution is 3.3 
percent. 

The evaluation of the 8K-bit cell array chips 
demonstrated that our standard process is feasi- 
ble to Josephson integrated circuits including 
several thousand junctions. 


4. Advanced processes 
The standard process described in the 
previous chapter enables high-speed  opera- 
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Fig. 24—/, distribution for 8 192 cells on a chip. 


3.2 3.4 


tion of Josephson logic and memory circuits 
described in Part II'®’ to be verified. Josephson 
device potential is very high, however, current 
circuit design and process techniques remain 
relatively primitive when compared to semi- 
conductor devices. To keep pace with semicon- 
ductor device development, Josephson circuit 
process technology must be further developed to 
achieve higher circuit density and enhance 
process reliability, for example. Four advanced 
process techniques have been recently 
developed: the fabrication of submicrometer 
junctions”? three-dimensional Josephson cir- 
cuits?) anodization to characterize the 
thin tunneling barrier™?” , and bias-sputtered 
Nb applied to wiring” The microstructure of 
Nb/AIO, /Nb junctions was also studied by using 


transmission electron microscopy (TEM). 


4.1 Submicrometer junction 

Figure 25 compares the cross section of the 
conventional junction with a new junction struc- 
ture developed through miniaturization®”® . A 
contact hole in the insulator is formed above the 
junction in the conventional structure. A sub- 
micron junction is difficult to make because the 
contact hole size must be smaller than the 
junction size. In the new structure, the junction 
area is defined by anodizing the Nb counter 
electrode. The anodized film thickness is in-situ 
monitored by the anodization profile methods?” 
described in Sec. 4.3. The anodized Nb, also 
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Substrate \ ; 
AlO, barrier 


a) Conventional process 


b) Self-aligned process 


Fig. 25—Josephson junction. 


Ver. : 20 wA/div Hor.: 1 mV/div 


Fig. 26—I-V characteristics in the 0.7-um-sequare 
Nb/A1O,/Nb junction. 


acting as an insulator between the base electrode 
and wiring, enables formation of self-aligned 
contact holes. Thus, a small junction can be 
defined independently of the contact hole size. 

Figure 26 shows an I-V characteristic for a 
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Fig. 27—Linearity of the tunneling current and the junc- 
tion area. /yn is the tunneling current at 
4 mV, and D is the side length of square junc- 
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0.7-um-square junction. The junction has one 
excellent characteristic: V,;, exceeds 30 mV. 
Here, electron beam lithography is used to make 
the resist pattern defining the junction area. 
The tunneling current at 4 mV is plotted against 
the junction size shown in Fig. 27. Linearity is 
observed for junction sizes of 0.7-5 wm. Pattern 
deviation in the junction size is estimated at 
only 0.02 ym. 

To achieve high-quality submicron junctions, 
another technical problem had to be solved — 
the intrinsic stress in Nb film’®”. Sputtered Nb 
film reportedly contains stress apparently related 


to the junction characteristics’°”. Such stress 
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varies with the Ar pressure during sputtering (see 
Fig. 5). Stress-free Nb attained by optimizing 
the Ar pressure was used for junction electrodes 
to make submicrometer junctions”. The high- 
quality characteristics are due to the use of 
stress-free Nb electrodes. 

The stress observed was related to the micro- 
scopic crystal structure of sputtered Nb. The 
lattice constant of Nb is plotted against the film 
stress in Fig. 28'. The linear relationship in- 
dicates that the crystal structure of Nb is dis- 
torted in proportion to the stress. How Nb 
stress deteriorates smalljunction quality is not 
yet clear, but stress relaxation was observed 
during Nb film patterning!™ | As shown 
in Fig. 29, the lattice constant of compressive 
and tensile Nb clearly changes when the film is 
patterned, and shifts more markedly as the 
pattern size decreases. In contrast, stress-free Nb 
exhibits no change. These results suggest that 
film stress relaxes at the periphery in patterned 
Nb, thus adversely affecting small-junction 
quality. 


4.2 Planarization and three-dimensional circuits 
As mentioned before, the sputtered SiO, 
insulator helps improve the production yield of 
Josephson circuits. But, SiO, integrity is not yet 
sufficient to increase circuit integration to the 


Ar pressure (Pa) 


0.332 


0.330 


0.328 


Lattice constant (nm) 


0.326 
] 10 100 


Width of lines and spaces (4~m) 


Fig. 29—Changes in the Nb lattice constant during 
patterning (Hollow symbols: before patterning, 
solid symbols; after patterning). 
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VLSI level. This is mainly due to the poor step 
coverage of SiOz deposited on the patterned Nb. 
Planarizing the step of the underlying layer 
must be developed. One possiblity is to use bias- 
sputtered SiO, 103) Figure 30 compares the 
breakdown voltages of sputtered and _bias- 
sputtered SiO, deposited on the patterned 
Nb?). The breakdown voltage, increased by a 
factor of 2 or 3, indicates the improved step 
coverage of bias-sputtered SiOz. The production 


Sputtered SiO, 


Bias-sputtered SiO, 


Frequency 


0 100 200 300 
Breakdown voltage (V) 
Fig. 30—Breakdown voltages of sputtered and bias 
sputtered SiO, films deposited on patterned 
Nb films. 


oct " —1 


a) Sputtered SiO, b) Bias-sputtered SiO, 


Fig. 31—SEM photographs of junctions. 
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yield was verified as being improved by changing Figure 31 shows SEM photographs of 
the insulator from sputtered to bias-sputtered junctions made with sputtered SiO, and bias- 
SiO). sputtered SiOz; the steps of underlying Nb 


Ver.: 2mA/div Hor.: 1mV/div Ver.: 1mA/div Hor.: 10 mV/div 
a) Single junction b) 20-junction chain 


Fig. 32—I-V characteristics of junctions planarized with bias-sputtered SiO, . 


Ver. : 0.2 mA/div 
Hor.: 1mV/div 


Upper-level junction 


a) I-V characteristics 


Upper-level junction 


Lower-level junction 


Cross section SEM photograph 
b) Stacked junctions 


lum 


Fig. 33 —Vertically stacked junctions. 
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electrodes are planarized to obtain a smoother 
device surface. Figure 32 shows the I-V charac- 
teristics of the junctions planarized with bias- 
sputtered SiO,. Although © bias-sputtering 
accompanies additional ion bombardment, no 
deterioration was observed in the I-V charac- 
teristics. 

We tried to stack other junctions above the 
junctions planarized by using bias-sputtered 


ne 


Fig. 34—Three-dimensional 10-Kgate array chip, 7.8 mm 
square. A 5 292 gates are integrated in both 
upper- and lower-level circuits. 


Junction 


Si substrate 


SiO, a Figure 33 shows a cross section of the 
vertically stacked junction, and the I-V charac- 
teristics for the upper-level and the lower-level 
junctions. By using 600-nm bias-sputtered SiO2, 
high-quality characteristics could be obtained, 
even for the upper-level junctions. 

Planarization increases the possibilities of 
developing three-dimensional Josephson  cir- 
cuits!”. A gate array including 10 584 gates was 
fabricated as a test device for the first three- 
dimensional Josephson circuits (see Fig. 34). 
The upper-level and lower-level circuits both 
contain 5-K Josephson gates. Figure 35 shows an 
SEM photograph of the vertically stacked gate 
array. A common ground plane is sandwiched 
between the lower- and upper-level circuits. 
Lower-level gates are set upside down to the 
upper-level gates. The high quality of the 
252-gate chains was verified for both the upper- 
and lower-level gates (see Fig. 36). 


4.3 Junction barrier diagnosis 

Josephson junction characteristics largely 
depend on the formation of a uniform, high- 
quality tunneling barrier. Since the Josephson 
effect only appears at such low temperatures as 
4.2 K, junctions cannot really be evaluated until 
processing is completed. It is also difficult to 
diagnose the thin AlO,-Al barrier using conven- 
tional analytical methods because the barrier is 
very thin and sandwiched between two Nb elec- 
trodes. These are serious problems hindering the 


Junction 


GP 


Fig. 35—SEM photograph of a vertically stacked gate array. 
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Si substrate 


a) Upper-level circuit 


Si substrate 


Ver. : 0.5mA/div Hor.: 100 mV/div 


b) Lower-level circuit 


Fig.36—Wiring diagram and I-V characteristics for 252-gate chain. 


dV/dt (V/s) 


Upper Nb Lower Nb 


>< —> 
AlO,-Al 


0 20 40 60 80 
Voltage (V) 


Fig. 37—Anodization profile for a Nb/AlO,/Nb junc- 
tion. The upper Nb and AI layers are 30 nm and 
7 nm thick. 


development of improved Josephson process 
techniques. 

An anodization profile proposed for 
evaluating multilayered structures was applied to 
Nb/AIO,./Nb: junction structures”?’?”?. Many 
junction structures were studied by using the 
anodization profiles. Figure 37 shows an ano- 
dization profile for the Nb/AlO,/Nb junction. 
The anodization proceeds in proportion to the 


104) 
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dV/dt (V/s) 


0 20 40 60 80 
Voltage (V) 


Fig. 38—Anodization profile for a Nb/Al/Nb structure. 


voltage: 0.85 nm/V for Nb and 0.88 nm/V for 
Al. From the profile, the sharpness of interfaces 
in multilayered structures can be estimated with- 
in an accuracy of 0.5 nm. Such a precise analysis 
is not possible by using conventional methods of 
analysis. 

Figure 38 shows a profile of the Nb/Al/Nb 
structure. Without the .AlO,. , diffusion between 
the upper Nb and Al is clearly observed. As 
evident in Figs.37 and 38, AlO, prevents the 
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Annealing 
temperature (°C) 
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dV/dt (V/s) 
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Fig. 39—Anodization profiles of Nb/AlO,/Nb junctions 
annealed for 30 min. 


Al layer (nm) 


0 20 40) 60 80 
Voltage (V) 


Fig. 41—Anodization profiles of Nb/AlO, /Nb junctions 
with five different Al thicknesses. The upper 
and lower Nb are 30 nm and 15 nm thick. 


diffusion of Al and the upper Nb. In other 
words, the AIO, barrier is not damaged during 
deposition of the upper Nb. 

Figure 39 compares the anodization profiles 
for annealed junctions. By annealing, two 
changes occur in the profiles. One is the diffu- 
sion between Al and the lower Nb. The other is 
the increased peak height of AlO,. Figure 40 
shows the I-V characteristics of annealed junc- 
tions. Changes in the characteristics feature a re- 
duced /, and increased leakage current. Both 
changes are related to changes in the anodization 
profiles. 

Figure 41 compares the anodization profiles 
with Al from 1.4 nm to 7.0 nm thick. Al barriers 
as small as a few nanometers thick can definitely 
be observed. From the barrier anodization volt- 
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1 mV/div 


Ver. : 

0.01 mA/div 
Hor. : 

1 mV/div 


Ver. : 
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Fig. 40—I-V characteristics of junctions. 


age width (Vg), we can calculate the thickness 
of Al to be oxidized into AIO, as about 
1 nm?°5), 

The anodization profile has been verified as 
an effective tool for accurately analyzing junc- 
tion barriers with high precision. 


4.4 Bias-sputtered Nb for wiring 
Using Nb wiring in Josephson circuits poses 
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a) Bias voltage: 0 V 


* 


r - 
a ~ a 
4 


tet 


c) —200 V d) —300 V 


Fig. 42—Cross-sectional SEM photographs of Nb depos- 
ited on SiO, steps. Nb and SiO, are 500 nm 


thick. 


7 re Tere : 500 nm 
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0 —100 —200 — 300 
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Fig. 43 —Critical current /, of bias-sputtered Nb films. 
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Contact hole 300 500 
size (um) 


I, (mA) 


0 —100 —200 
Bias voltage (V) 
Fig. 44—Critical current J, of contacts versus bias 
voltage during Nb sputtering. 


two problems: reduced /, of wiring at steps and 
after annealing. This is due to the effectively 
reduced thickness of Nb wiring running over the 
steps. To enhance Nb wiring reliability, we devel- 
oped bias-sputtered Nb for wiring in circuits”® . 
The step coverage of Nb is improved by adding 
the bias voltage during sputtering (see Fig. 42). 
Figure 43 shows the superconducting critical 
current (J,) of bias-sputtered Nb film. For bias 
voltage less than —150 V, the deterioration in 
superconducting characteristics is negligible. 
Based on these rusults, bias-sputtered Nb was 
applied to the wiring used in Josephson circuits. 
We confirmed that the Nb/AlO,/Nb junction 
was not affected by using the bias-sputtered Nb. 
The characteristics of contacts were also im- 
proved before and after annealing. The wiring 
running over the steps also showed improvement 
in terms of their superconducting characteristics. 

Figure 44 shows the /, of contacts as a func- 
tion of the bias voltage during wiring sputtering. 
The Nb base electrode is 200 nm thick, the SiO 
insulator is 300 nm thick. and the Nb wiring is 
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Wiring: 500 nm 
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Fig. 45—Changes in /, of contact after annealing. 
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Fig. 46—Lattice image of Nb/AlO,/Nb junction 
structure. 


300 nm to 500 nm thick. By applying bias volt- 
age, J, is increased by tactors of 2 to 10. In 
addition, the contacts with bias-sputtered Nb 
were observed to be stable against annealing. 
Figure 45 shows changes in /, after annealing. 
The /, decreases with the annealing temperature. 
The contacts with bias-sputtered Nb, however, 
are much more stable than those with sputtered 
Nb. These improvements in contact characteris- 
tics before and after annealing are due to the 
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effectively increased Nb thickness at the steps. 
Thus, the bias-sputtered Nb offers promise for 
wiring use in Josephson LSI application. 
4.5 Cross-sectional TEM observation of junction 

structures 

The microstructures of the Nb/AIO,/Nb' 
junction were studied through cross-sectional 
TEM observation”. Figure 46 shows the lat- 
tice image of the Nb/AlO,/Nb junction struc- 
ture. The upper and lower Nb are 20-200 nm, 
with Al 14nm thick. From the lattice image, 
two interesting pieces of information were ob- 
tained. One is the crystallization of Nb and Al 
(thin Al as well as Nb are polycrystalline films 
with columnar structures). Most grains are ori- 
ented to a certain plane; Nb(110) planes are 
parallel to the substrate. The orientation of 
Nb(110) is consistent with the results obtained 
by X-ray diffraction. Al is also oriented to the 
(111) plane. Thus, the crystalline Al grown 
on Nb is considered critical to producing high- 
quality junctions. Also note that the upper Nb 
features a good crystalline structure just above 
the AlO, barrier. This indicates that diffusion 
did not occur between the upper Nb and AIO, 
barrier. 

The other piece of information obtained 
concerns the Al and Nb interfaces. The interface 
between the lower Nb and AI is rough and wavy 
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due to the rough surface of the lower Nb. The 
other interface between the upper Nb and AlO, 
is sharp and clear. This indicates that AI planar- 
izes the lower Nb surface. Consequently, the 
planarized Al surface is considered essential to 
growing an AIO, barrier without pinholes. 

The TEM studies indicated that a material 
combination of Nb, Al and AlOx is nearly ideal 
for the Josephson junctions. 


5. Conclusion 

This paper reviewed the fabrication technol- 
ogy developed in our laboratories for Josephson 
integrated circuits, and described the techniques 
used in the standard process for various circuits 
elements, Nb/AlO,/Nb junctions, SiO, insula- 
tors, Nb wiring, contacts, and Mo resistors. The 
key process to producing Josephson circuits is 
obtaining reliable and high-quality junctions. 
The characteristics of Nb/AlO,/Nb junctions 
were found to satisfy these requirements almost 
perfectly. Therefore, a combination of Nb, Al 
and AlO, is deemed the best one for Josephson 
junctions. This paper also described how process 
techniques were verified as being practical for 
Josephson circuits that include several thousand 
junctions, based on the characteristics of the 
8K-bit cell array chips fabricated for process 
evaluation. Without the process technology 
based on the reliable Nb/AIO,/Nb junctions, the 
ultra-high-speed operations recently achieved in 
Josephson integrated circuits would not have 
been possible. This paper also introduced 
the advanced process technology developed for 
future Josephson LSI application. These tech- 
niques satisfy demanding requirements to 
enhance circuit integration and improve process 
reliability even more. 

Existing Josephson process technology was 
developed by several researchers. Although this 
technology may appear primitive when com- 
pared to semiconductor technology, the high- 
speed and low-power operation made possible 
when using Josephson logic and memory circuits 
are not possible with semiconductors. This 
verifies the potential application of Nb/AIO, /Nb 
Josephson junctions as digital devices in fu- 
ture high-speed computers. Based on_ the 
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Nb/AIO,/Nb junction technology, Josephson 
computer development may now be seriously 
undertaken. To achieve this lofty goal, many 
technical breakthroughs in process technology 
must be made, particularly in terms of reliability 
and uniformity. Perhaps a greater research effort 
should be initiated in the field of superconduc- 
tive electronics. 
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Josephson junctions with Nb/AIO,/Nb structures have made it possible to fabricate a 
variety of high-speed circuits, including an 8-bit digital signal processor and 4K-bit memory. 
These circuits operate at ultrafast speeds and consume less power than any high-speed 
semiconductor circuits. An interface circuit which issues the signal from Josephson circuits 
to semiconductor circuits has also been fabricated. In short, it is possible to fabricate all 
components necessary for constructing a Josephson computer. Now the high-speed opera- 
tion of the Josephson computer must be demonstrated. 


4),5) 


1. Introduction operates on a clock frequency above | GHz 


The development of the niobium junction 
(Nb/AIO,. /Nb) for Josephson integrated circuits 
has changed what appeared to be a closed future 
to one with great potential. As described in the 
previous part of this paper!?, niobium junctions 
have solved most problems caused by the poor 
characteristics of lead-alloy junctions??. The 
niobium junction is very stable against thermal 
cycling and long-term storage. The scattering of 
critical current is quite small, and various 
circuits have been made possible with niobium 
junctions. These circuits operate much faster 
than those using lead-alloy junctions. 

When using lead-alloy junctions, critical 
current was widely scattered and the integrated 
circuit speed was limited to that of the slowest 
junction. Josephson integrated circuits are 
inherently faster than semiconductor devices, 
although actually fabricated circuits were not 
as fast as expected. The inherent nature of 
Josephson junctions could not be fully utilized 
with lead-alloy junctions. The uniformity of 
niobium junctions has, however, made it possible 
to realize the potential performance capabilities 
of Josephson junctions, such as the fastest gate 
operating at a switching time of 1.5 ps/gate?. 
An integrated circuit with a few thousand gates 
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High-speed logic circuits with a few thousand 
gates and 4K-bit memory”? have been developed. 

This paper describes the recent progress 
made in Josephson digital circuit development at 
our laboratories, using Nb/AlO,/Nb junctions. 
Chapter 2 describes the principles of circuit 
operation. Chapter 3 introduces the high-speed 
gate family based on modified variable threshold 
logic (MVTL)”. Chapter 4 covers the key tech- 
niques for fabricating high-speed circuits. 
Chapter 5 details a variety of high-speed logic 
circuits. Chapter6 introduces the memory 
circuits. Chapter 7 describes the interfacing 
between Josephson and semiconductor circuits. 
Chapter 8 describes the progress being made in 
Josephson computer development. In conclu- 
sion, chapter 9 summarizes this paper. 


2. Basic principles of Josephson digital circuits 

The main features of the Josephson junction 
are high-speed switching (1-10 ps/gate), low- 
power consumption (1-10 wW/gate), and low-dis- 
persion signal transmission (at about 100 um/ps). 
Figure | shows the relationship between the 
switching time and power consumption for 
different high-speed logic gates, clearly indicat- 
ing the high-speed, low-power capability of the 
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Fig. 1—Relationship of switching time and power 
consumption for different high-speed logic gates. 
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Fig. 2—Equivalent circuits. 


Josephson logic gates. Due to these attractive 
features, Josephson junctions are expected to be 
used in ultrahigh-speed computers. 

The Josephson logic gate operates different- 
ly than a semiconductor gate. The most impor- 
tant difference is that the Josephson gate 
switches in latching mode, while the semicon- 
ductor gate only changes its state when an input 
signal is applied. The Josephson gate, however, 
does not return to its initial state after the input 
signal is turned off, and is supplied by pulses 
rather than the dc power, with the frequency of 
the pulses determining the clock frequency. The 
supply current flowing through the gate is called 
bias current. 

Josephson logic gates are classified into 
magnetic-coupling and current-injection. Figure 2 
shows typical examples. Figure 3 shows the 
threshold characteristics of these gates. Figure 
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Fig. 3—Threshold characteristics. 
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Fig. 4—Computer simulated switching operation of 
Josephson logic gate. 


2a) shows the equivalent circuit of a magnetic- 
coupling gate with two junctions. Input signal 
current is applied to superconducting inductance 
coupled to another inductance loop in which 
two junctions are connected. The magnetic 
field caused by input signal current reduces the 
critical current of the gate | see Fig. 3a)! . When 
the bias current exceeds the reduced critical 
current, the gate switches from the supercon- 
ducting state to the voltage state. Figure 2 b) 
shows the equivalent circuit of a current-injection 
gate. Input signal current is applied to the gate 
junction. The input current causes the gate 
junction to switch to the voltage state | see 
Fig. 3b)} , but the gate junction cannot isolate 
the input signal current from that output. Thus, 
an additional junction (isolation junction) and 
a resistor (isolation resistor) are connected to 
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the gate junction to prevent the input signal 
current from flowing to the output terminal. 

Figure 4 shows computer-simulated wave- 
forms of Josephson logic gate operation for the 
magnetic-coupling gate. Figure 4a) shows the 
operating point movement regarding the I-V 
characteristics. When bias current is applied to 
the gate, the operating point moves from origin 
O to point A, where the gate enters the standby 
state for switching to voltage state B. 

Figure 4b) shows the waveform of bias 
current Jp, 4c) input signal current /;,, and 4d) 
junction voltage Vy. When J/g is smaller than 
critical current Jc, the junction is in the zero 
voltage state (at operating point A). When input 
signal current Jj, is applied to the gate, it gener- 
ates a magnetic field that couples to the gate 
inductance and reduces Jc. When Jc becomes 
less than Jp, the gate switches to the voltage 
state (operating point B). After the switching to 
the voltage state, the operating point stays at B, 
even if Jj, is removed. The gate can only be reset 
by decreasing Jp. The operating point goes from 
O to A to B to O. When the input signal is zero, 
Ic does not decrease below Jp, so nothing 
happens at A. O is only returned to when Jp is 
turned off. Consequently, the operating point 


a) Magnetic-coupling gate 


b) Current-injection gate 


Fig. 5—Logic circuits. 
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moves simply from O to A to O. 

Figure 5 shows logic circuits consisting of 
magnetic-coupling and current-injection gates. 
The main difference between these circuits is in 
the fan-out signal coupling. In the magnetic- 
coupling gate, the output signal is issued serially 
to multiple gates at subsequent stages. The 
current-injection gate applied its output signals 
in parallel. 

Different types of Josephson logic gates have 
been proposed (see Fig. 6)*®??). Some gates 
have been replaced by newly developed and 
improved gates. Josephson interferometer logic 
(JIL), Josephson threshold logic (JHTL)!??, 
4junction logic (4JL)'®, resistor coupled 
Josephson logic (RCJL) , and modified variable 
threshold logic (MVTL)??? gates are now being 
used in high-speed circuits. Among these, the 
fastest switching speed (1.5 ps/gate) has been 
recorded with the MVTL OR gate? as described 
in the next chapter. 


3. MVTL gate family?” >” 

Josephson logic circuits usually consist of 
OR and AND gates operated in a dual-rail 
configuration!» because an inverter cannot 
be constructed without a timing signal. The 
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Fig. 6—Classification of Josephson logic gates. 
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AND gate is usually driven by the output signals 
of OR gates because the AND gate cannot 
isolate the output signal from the input signal by 
itself. The MVTL gate family consists of OR, 
AND, and 2/3 majority (MJ) gates. The timed 
inverter (TI) is sometimes combined with an OR 
gate and another single junction. This chapter 
discusses the members of the MVTL gate family. 


Bias current 


Output current 


Input current 


a) Basic structure for the MVTL OR gate, whose 
input current both couples to the interferometer 
magnetically and it is injected. 
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b) Gate with a magnetic coupling only 
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Fig. 7—Equivalent circuits for three coupling structures. 
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3.1 MVTL OR gate 

The MVTL OR gate is the most important 
member of the Josephson logic gate family 
because it operates both as an OR gate and an 
isolator between input and output. The TI 
structure also includes an OR gate. 

Figure 7a) shows the equivalent circuit of 
the basic MVTL OR gate. It is an asymmetric 
interferometer or 2-junction superconducting 
quantum interference device (SQUID), con- 
sisting of two Josephson junctions J; and Jz and 
inductance L. The critical current of J; is p/m 
and that of Jz is gly, where p + q = 1 and J, is 
the maximum supercurrent of the interfero- 
meter. Inductance L is magnetically coupled to 
control line inductance ZL, through mutual 
inductance M. Bias current Jpg is applied to the 
point between left branch inductance gL and 
right-branch inductance pL, thus ensuring that 
the upper limit of the bias current is at its 
maximum, /,. The input signal current is fed 
through inductance L, and applied to the inter- 
ferometer. Thus, the signal current flows through 
magnetically coupled control line L, and is 
applied to the interferometer. Figure 8a) shows 
the threshold curves of the gate shown in Fig. 7a). 
The bold line shows the main mode of thresh- 
old. The lighter lines show the modes including 
flux quantum ®, (=A/2e = 2.07 x 10715 Wb, 
where e is the electron charge and h/ the Plank’s 
constant) in the superconducting loop. 

Figures 7b) and 8b) respectively show a 
simple magnetic coupling structure and _ its 
threshold curve, while Figs. 7c) and 8c) show 
the current injection structure and its threshold 
curve for reference. As shown in the threshold 
curves of Fig. 8, the slope of the curve becomes 
steep when both magnetic coupling and current 
injection are made. The slope of the MVTL gate 
increases at a factor of about 1.6 so that a small- 
er signal can be used to switch the gate instead 
of only using magnetic coupling or current 
injection. 

The gate shown in Fig. 7a) cannot isolate 
the input signal from the output one because the 
input signal current flows directly from the 
output terminal. Therefore, an extra junction J3 
and resistor R; are added to the actual MVTL 


31 


S. Hasuo: Josephson Integrated Circuits H: High-Speed Digital Circuits 


a) Basic structure for the MVTL OR gate 
Ip 


b) Gate with a magnetic coupling only 


c) Gate with current injection 


Fig. 8—Threshold curves for coupling schemes shown in 
Fig. 7. g/p is chosen to be 3 for the three 
structures. 


OR gate as shown in Fig. 9. Input signals are 
added and fed through inductance L, and 
injected into the interferometer through isola- 
tion junction J3. 

Junctions J; and /2 switch due to the bias 
and input signal current being applied. while the 
operating point crosses the threshold value of 
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Fig. 9-MVTL OR gate. 


Table 1. OR gate design parameters 


Parameter Optimized value 
Lov 0.4mA 
q/p 3 
LIm/®, 1.0 
L 5.2 pH 
M 0.8 L 
1 i 3-5 L 
Rj 12 
Rq 1.8 Q 


®,: flux quantum (2.07 x 10 +15 wp) 


the interferometer. The bias current then flows 
through J3 and Rj to ground. J3 is switched to 
the voltage state and most of the bias current 
flows to load resistor Ry. After the gate switches 
to the voltage state, the input current flows 
through AR; to ground, thus isolating input and 
output signals. Reference 22 details the MVTL 
gate design, and is not included here. Table 1 
lists the optimized design parameters. 
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a) Photomicrograph 
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b) Experimentally obtained threshold curve 
Curves should be compared with those of Fig. 8a) 


Fig. 1O—MVTL OR gate. 
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Fig. 11—Power-delay relationship of MVTL OR gate with 
a minimum junction diameter of 1.2 pm? 


Figure 10 shows the OR gate and experi- 
mentally obtained threshold curve. This curve 
was obtained for a special gate without isolation 
junction /3 and isolation resistor Rj as shown in 
Fig. 7a), and should be compared with the 
theoretical curve of Fig. 8a). The actual gate 
has three Josephson junctions J;, J>, and J3 
with respective diameters of 2.5, 4, and 2.5 wm. 
The gate with the minimum junction diameter 
of 2.5 wm is 31 x 4lym?. The switching delay, 
measured with a 100-stage gate chain, operates 
at 4.2 ps/gate?*?. Operating speed was increased 
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Fig. 12—Relationship between fastest gate delay and 
minimum junction diameter of MVTL OR gate. 


by reducing the junction diameter. Five types of 
MVTL OR gates with different junction sizes 
have already been fabricated. The minimum 
junction diameter was changed from 4 yum to 
1.2 um. The fastest gate speed (1.5 ps/gate) was 
obtained using a gate with a minimum junction 
diameter of 1.2 ym*. Fhis speed was achieved 
under a power consumption condition of 
12 upW/gate. Figure 11 shows the power-delay 
relationship of this gate. It also shows that the 
3 ps/gate speed can be achieved even during low- 
power operation at 5 uW/gate. 
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Fig. 13—Single junction AND. 
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b) Its symbol 


Fig. 14—20OR-AND gate (unit cell). 
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Figure 12 shows the relationship between 
the fastest gate delay and minimum junction 
diameter for five MVTL OR gates with different 
minimum junction sizes?)’??)-2). Note that 
Josephson logic gates can attain a gate delay of 
less than 2 ps/gate without using submicron 
technology. This is an advantage of Josephson 
devices because submicron or even quarter- 
micron technology is essential for semiconductor 
devices in achieving speeds up to 10 ps/gate. 
Figure 12 suggests that a sub-ps/gate delay may 
be achieved by fabricating a gate with junctions 
0.7-u~m in minimum diameter. 


3.2 MVTL gate family*)-?*) 

An AND gate is constructed with single 
junction J, as shown in Fig. 13a). Critical 
current /7, of junction J, is designed so that J, 
can be switched at the summation of two input 
currents for total current exceeding critical 
current /y,. The operating margin for the input 
current is +33% {see Fig. 13b)}. When the 
critical current varies +20%, the operating margin 
becomes +14%. To prevent the operating margin 
from decreasing, this AND gate receives no bias 
current and cannot provide isolation. As a 
result, the AND gate is always used behind the 
OR gate, called the 20R-AND gate or unit cell, 
in the OR-AND combination shown in Fig. 14. 


FUJITSU Sci. Tech. J., 27, 1, (April 1991) 


S. Hasuo: Josephson Integrated Circuits I: High-Speed Digital Circuits 


ees a 


24m 


824m 


Fig. 15—Photomicrograph of 20R-AND gate. 


Fig. 16—Equivalent circuit of 2/3 majority gate. 


To prevent J, from being switched by leakage 
current from the next-stage gate, J, has resistor 
Rp and a junction J). The critical current of 
Jp equals that of J; of the OR gate. This is small 
enough to minimize current loss through Jp 
when switching to the voltage state. Figure 15 
shows a unit cell fabricated with a minimum 
junction diameter of 2.5 wm. The unit cell is 
82 um x 132 wm? in size with a gate delay of 
11.5 ps?*, The delay time for the gate with a 
minimum junction diameter of 4 um is 16 ps?*?. 

In an LSI logic circuit, the carry signal of a 
full adder can only be obtained by using a 2/3 
majority (MJ) gate. The 2/3 MJ gate in the 
MVTL gate family is basically the same as the 
unit cell, but has another OR gate connected at 
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a) Circuit diagram 


Bias 


In Out 


b) Symbol 


Out 0 / 1 \ 


Time 


c) Time sequence 


Fig. 17—Timed inverter (TI)??). 


junction J,. The 2/3 MJ gate is constructed by 
adding an OR gate (see Fig. 16). The operating 
speed of the 2/3 MJ gate was evaluated at 
21 ps?®) for a minimum junction diameter of 
4 um. 

The latching characteristics of the Josephson 
junction make it impossible to fabricate in- 
verters. The Josephson logic gate requires a 
timing signal for inversion, and the inverter 
gate must be constructed with a timing signal. 
This: type of gate is called a timed inverter (TI) 
(see Fig. 17)??”. The TI operates correctly when 
signal current is applied before the bias current 
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Table 2. MVTL gate family performance 


" JJ dia 


\(um) 4.0 25 2.0 1:5 1.2 
Gate 


OR 5.6 | 4.2 3.3 25 1.5 
+ 
20R-AND 
(unit cell) 16.0 11.5 
30R-AND 
(majority) 21.0 


a a 


Unit: ps/gate or ps/unit cell 


rises. 

Table 2 summarizes the delay times obtained 
for different feature sizes**?!)-?. The TI 
delay is not listed here because a timing signal 
is required and the gate delay depends on the 
signal timing. 


4. Key techniques for fabricating high-speed 
circuits 

Josephson devices operate more differently 
than semiconductor devices and require specific 
techniques to make full use of their performance 
potential. The fabrication of Nb/AlIO,/Nb 
junction detailed in this paper”? was a measure 
breakthrough. 

Another important technique is the high- 
speed gate with a sufficient operating margin for 
constructing high-speed circuits. This also re- 
quires a gate family consisting of several kinds of 
gates with different functions, such as the 
MVTL gate family described previously. The 
MVTL gate has a large operating margin and 
high sensitivity to input signal current, and can 
attain ultra-fast switching speed. The MVTL gate 
family facilitates the design of any kind of 
circuit because it enables a systematic circuit 
layout. 

Two important techniques needed for high- 
speed circuits in addition to the Nb/AlO,/Nb 
junction and the MVTL gate family are dual-rail 
logic and the 3-phase powering system. 


4.1 Dual-rail logic 

In semiconductor devices, output 0 is easily 
obtained for an input of 1. In contrast, the 
Josephson device outputs | when | is input 
because it has no inversion function. Thus, 
semiconductor circuits cannot simply be re- 
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Fig. 18—Circuit construction of C + AB. 


placed by Josephson devices. This requires 
special circuits and what is called dual-rail 
logic!” , 

True and complementary signals are always 
provided for the dual-rail logic circuit. For 
example, C+ AB is constructed with dual-rail 
logic (see Fig. 18). Only true signals A, B, and 
C are used on the upper plane of the circuit, 
while only complementary signals A, B, and C 
are used as input on the lower plane. Circuits for 
the upper and lower planes are set in a true and 
complementary relationship, that is, an OR gate 
is used on the upper plane instead of an AND 
gate on the lower plane and vice versa. When a 
complementary signal is needed in the true 
circuit, it is taken from the complementary 
circuit. Any circuit can be made without using an 
inverter. Even though dual-rail logic doubles the 
number of gates, it is usually used in Josephson 
LSI circuits due to the simple circuitry. 

A TI gate is sometimes used because a com- 
plementary signal is necessary for an input 
signal, even in dual-rail logic. We usually use TI 
gates at the leading edge of the bias current, and 
use 3-phase powering as described in the next 
section. The TI gates can be used three times in 
one clock cycle. The TI gates used in dual-rail 
logic circuits do not double the number of 
gates because only true signals are needed for 
output in each powering stage. Combining 
dual-rail logic and TI gates requires about 1.5 
times as many gates as required by semicon- 
ductor circuits. 
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c) Current waveforms from a regulator 


Fig. 19—Alternate current powering with single-phase 
power supply. 


4.2 Three-phase powering system 

Josephson logic gates operate in latching 
mode, with the gate switching to the voltage 
state only when an input signal is applied. 
Once switched, the gate does not return to 
the zero-voltage state, even after the input 
signal is turned off. Therefore the power supply 
driving Josephson logic gates must be turned off 
to reset the gate in each clock cycle. Alternate 
current power is used for this purpose?®)-%°) | 
Sinusoidal ac power is supplied from an external 
source and is reformed as trapezoidal ac power 
by using a regulator circuit. The regulator usual- 
ly consists of four Josephson junctions connect- 
ed in series. Figure 19 shows how the regulator 
usually consists of four Josephson junctions 
connected in series. Figure 19 shows how the 
regulator generates the clipped top waveform 
of the bias current. Logic operation is executed 
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Fig. 20—Alternate current powering system with 
single-phase power supply. 


in the flat part of the source power. 

Due to symmetrical current-voltage character- 
istics of Josephson junctions, positive and 
negative bias current can be used for logic 
operation. A latch circuit is used to retain 
information when bias current changes from 
positive to negative or vice versa?) 34) The 
information is stored in the form of persistent 
current in the superconducting loop without 
any bias current. 

Figure 20 shows the circuit construction 
with an ac power supply. The conventional 
combinational circuit (CC) is accompanied 
by a latch circuit. The information from the 
previous clock cycle is stored in the latch 
when ac bias current changes its polarity, 
and is transferred to the adjacent CC during 
the changing period to the next clock cycle. 
The latch consists of superconducing flip-flops 
that temporarily store the information. There 
are other problems in using ac power, such 
as CCs not being activated when power is 
in transition. To increase the clock frequency, 
we must reduce the transient time. This reduc- 
tion would result in a malfunction caused 
by punch-through?»)"3” , however. Another 
problem is posed by the polarity of bias current 
being reversed in each clock cycle so that 
the information stored in the latch cannot 
be directly used in the next CC, In addition, 
a latch consisting of magnetically coupled 
devices would increase the size of the circuit. 
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a) Circuit configuration with latch circuits 
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Fig. 21—Two-phase monopolar powering system. 
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Fig. 22—Logic operation with three-phase sinusoidal 
power supply. 


Different types of multi-phase monopolar 
powering have been proposed to resolved 
the problems encountered in ac or bipolar 
powering. Figure 21 shows 2-phase monopolar 
powering>”. The two regulated monopolar 
power supplies ¢, and ¢, are 180° out of 
phase. At the leading edge of @,, the latch in 
phase ¢, reads the output of the CC in phase 
@, and information is transferred to the CC 
in ¢,. Logic operation is executed under ¢, 
and terminated before $, begins to rise. After 
output is realized by the latch in ¢,, the CC 
in @, and the latch in ¢, enter the idle state 
when @, is off. This powering system has 
several advantages. Logic operation is continued 
alternately in ¢, and @,. The shaded areas 
in Fig. 21 indicate the active times when logic 
operation is executed. In this way, there is 
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Fig. 23 —-Three-phase monopolar powering with 
sinusoidal waves. 


no idle time so that the system can operate 
faster than with ac power. The latch can consist 
of current-injection gates because the polarity of 
bias current is not reversed and the latch can be 
made smaller. 

We have developed a 3-phase powering 
system. Three sinusoidal waveforms with dc 
offset are used, with each being 120° out 
of phase. Figure 22 shows the logic operation 
using this system. When logic operation is 
executed and completed in phase @,, data 
is transferred to the circuits driven by phase 
g@, and processed there. After ¢, is completed, 
data is sent to the circuits driven by $3. Data 
is retransferred from circuits in $3 to those 
in ¢,. This completes one clock cycle in 3-phase 
powering. This system requires no latch because 
processed data is transfeered as ¢,, $2, 93, 
¢,, ..., without wasting time. In three-phase 
powering, racing does not occur. This is a 
significant advantage over 2-phase powering. 
Figure 23 shows that the circuit configuration 
with 3-phase sinusoidal powering. The TI 
can also be used more frequently than in 2-phase 
powering. The input signal must be followed 
by bias current in the TI. This requirement 
is satisfied at every rise of the next clock cycle 
power. 

The 3-phase sinusoidal waveform has an 
important effect not possible with the single- 


phase or multi-phase trapezoidal waveform. 
The 3-phase sinusoidal waveform reduces 


deviation in the ground voltage, which is vital 
to the high-speed operation of Josephson 
circuit. Bias current flows through contacts 
and is terminated at the common superconduct- 
ing ground plane. If the total current through 
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the ground plane changes drastically over time, 
a time variation is caused in the voltage level 
of the ground plane, which, in turn, may cause 
a logic gate malfunction because the output 
voltage of the Josephson logic gate is only 
2.8mV. The clock frequency is on the order 
of 1 GHz, and the total supply current for 
the chip is about 1 A. The time variation of 
ground voltage V.; is given by L (d//dt), where 
L is stray inductance and / is supply current. 
For L of 50 pH, a typical value of flip-chip 
bonding, V,;, becomes 0.3 V. This is much 
larger than the signal voltage of 2.8 mV from 
the Josephson circuits. By using the 3-phase 
sinusoidal waveform, however, the total current 
to the ground plane does not change with time. 
As a result, the ground level is not affected 
by power supply changes. This is a significant 
advantage of the high-speed operation of 
Josephson logic circuits. Powering with a 
sinusoidal waveform results in a duty ratio of 
active time a little less than that with trapezoidal 
waveforms, although the duty ratio is only 
reduced by about ten percent, and is not critical 
to circuit operation. 


5. High-speed logic circuits 

Various high-speed Josephson LSIs have 
been recently developed. These include a 4-bit 
processor”, a 4chip 4bit computer™ , and 
1 K-bit memory”, all using niobium junctions. 

The MVTL gate family has been used 
to fabricate various logic circuits. These include 
an 8-bit shift register”, 16-bit arithmetic 
logic unit Lu. 4-bit microprocessor™, 
4-bit processor”, and 8-bit digital signal 
processor (DSP). The following sections 
describe the performance of these circuits. 


5.1 Shift register” 

We designed an 8-bit shift register with 
SHIFT, LOAD, HOLD, and CLEAR functions. 
Figure 24 shows a circuit diagram of the 1-bit 
shift register. The OR, 2OR-AND, and TI 
symbols are defined in Figs.9b), 14b), and 
17b). 

S:DS+L:-DL+H-Ty, denotes the output 
of the 1-bit shift register, and S, ZL and H denote 
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Fig. 24—One-bit shift register circuit. Input resistor Rin 
for the OR gate and TI and supply resistor R, 
for the OR gates are omitted. 


the control signals used for SHIFT, LOAD, and 
HOLD. DS and DL denote the data used for 
SHIFT and LOAD. Tg, denotes the output 
data for the 1-bit shift register from the 
preceding $3 phase power clock, and is used 
for HOLD. Such AND operations as S- DS, 
L-DL, and H.- Te. are executed using 2OR- 
AND gates powered by @, (see Fig. 24). 

In the 2OR-AND gates powered by ¢, 
and $3, one connecting line of each AND 
gate is not connected to the OR gate, but is 
connected through a 70 Q resistor to the power 
bus. These AND gates are used to amplify 
the current for a fan-out of 2. 

A circuit consisting of resistor R, and 
Josephson junction J, makes a l-shot pulse 
by switching J, when the data for LOAD 
DL rises. If the pulse width of DL is longer 
than one clock cycle, this circuit transfers 
the data for only one clock. 

This shift register outputs T9, > Tees and 
Co,. T and C denote true and complement. 
@, and @3 represent their power phases. Other 
power phase output can also be easily obtained 
by slightly modifying the circuit. 

Figure 25 shows the 8-bit shift register, 
which consists of the eight l-bit shift registers 
shown in Fig. 24. When | is applied to S, the con- 
tents of each 1-bit shift register are transferred 
to the next register to the right. The DS of each 
bit is given by output 73, of the preceding bit. 
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Fig. 26—Fabricated 8-bit shift register chip. 


The DS for bit 1 is given externally by DS,. 
When 1 is applied to L, LOAD is executed. 
In this operation, data from DL, to DLg is 
applied in all bits in parallel. When | is applied 
to H, HOLD is executed. The data for each bit 
is held from one clock to the next. When 0 is 
applied to all three control signals, the output 
of all bits becomes O after the CLEAR 
operation. 

The 8-bit shift register was fabricated using 
Nb/AIO,/Nb junctions, Mo resistors, and 
SiO, insulation, that is, the standard process 
described in another part of this paper). The 
minimum diameter of the junction is 2.5 um. 
The critical current density of 1700 A/cm? is 
about 20 percent less than the design value of 
2100 A/cm*, Figure 26 shows the fabricated 
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Fig. 27—Function tests of SHIFT, LOAD, HOLD, and 
CLEAR operations, 


chip. The circuit area is 1.1 x 2.1mm? and 
contains 112 gates. There is a total of 328 
Josephson junctions and 516 resistors. The total 
power consumption was 1.8 mW. 

Figure 27 shows the results of testing the 
shift register function using an 80-us clock. 
All functions operated correctly. SHIFT 
Operation was executed with clocks up to 
2.3 GHz (430 ps) as shown in Fig. 28. By apply- 
ing | to S and DS, continuously with a unit step 
pulse, the output of bit 1 was set to 1 and that 
of bit 8 was set to | after eight clock cycles. The 
operating margin of the power bus was only a 
point at 2.3 GHz for the SHIFT operation. For 
500-MHz clock operation, it was +8%. 

A pseudorandom bit sequence generator 
was also developed*”. The circuit is constructed 
with nine stages of the 1-bit shift register with 
its output being fed back to stage 5 through 
an exclusive-OR gate. This generator can 
generate a pseudorandom number with a 511-bit 
sequence. Correct operation was also confirmed 
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Fig. 28—Measured SHIFT operation with 2.3 GHz 
(430 ps) clock. 


up to 2.2 GHz. 


5.2 Arithmetic Logic Unit: ALU*” 

The ALU is a key processor component. 
The ALU designed with the MVTL gates 
performs 12 functions (see Table 3). Three-bit 
control signals Sy, S,;, and S,, and one carry 
signal Cj, from the previous stage are used to 
determine operation. By using different 
combinations of these four signals, the ALU 
performs eight arithmetic functions and four 


logic functions, 
A unique block layout was developed 


to achieve these functions using the MVTL 
2OR-AND unit cell. The unit cell executes 
logic operation (A + B)-(C+D), and the ALU 
diagram layout is designed based on this opera- 
tion. Figure 29 shows a 1-bit ALU consisting 
of 18 unit cells. Because dual-rail logic is used, 
complementary signals for both control and 
data are required. A, and B, are input signals, 
and F,, is the output data signal. 

Output terminals for the carry signal of 
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Fig. 29—One-bit ALU diagram. 


the 1-bit block shown in Fig. 29 are laid out 
for easy connection to the input terminals 
of the next block. This block can be used as 
a simple 1-bit slice ALU. When blocks are 
arranged serially, the input and output terminals 
are connected with the shortest possible wiring, 
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resulting in a multiple-bit ALU using the ripple 
carry. 

Figure 30 shows a 1-bit ALU composed 
of unit cells. The arrangement of cells and 
interconnecting lines corresponds to the 
diagram in Fig.-29. The carry signal flows from 
left to right. The minimum diameter of the 
junction is 2.5 um, The circuit area for the 
1-bit ALU, enclosed in dashed line in Fig. 30, is 
480 x 800 wm?. The area includes 18 unit 
cells, power buses for the OR gates, and the 
interconnecting lines. The power bus is 22 um 
wide, and provides stable power. A 2-layer 
superconductive metal was used for the inter- 
connecting lines. Line width and spacing are 
both 4 um. Contact holes are 2 x 2 um?. 

The multiple-bit ALU consists of 1-bit 
ALUs connected side by side. Figure 31 


480 nm 


Fig. 30—One-bit ALU. 


shows a fabricated chip whose circuit area is 
8.2 x 0.85 mm?. The sixteen 1-bit ALU blocks 
are connected serially. The total number of gates 
is 900. The critical current density (2 600 A/cm? ) 
of the Josephson junction is slightly higher than 
the optimum design value of 2 100 A/cm?. 

We confirmed correct circuit operation 
with a low-frequency clock (1 kHz), To measure 
the critical path delay, we set the ALU in adder 
mode. The critical path delay is the sum of 
carry propagation delays from bits 1 to 16 
when (000...01)+(111...11) is executed, 
and the shortest delay measured was 860 ps. 
The critical path of the ALU includes 83 stages 
of OR and AND gates, and the total length 
of superconducting lines between the unit cells 
is 13 mm. The propagation delay in the 13 mm 
lines was calculated as 95 ps. From these values, 
the average gate delay was estimated to be 
9.2 ps per gate. Thus, we _ succeeded in 
demonstrating sub-10 ps gate operation in an 
actual circuit with reasonable fan-in and fan-out 
for the first time. Since the total power 
consumed was 10.1 mW, the average power 
consumed per gate was 11.3 uW. 

5.3 Four-bit microprocessor™ 

We newly fabricated a 4-bit microprocessor 
in the world’s first application of Josephson 
devices to a microprocessor. This development 
confirms the ultra-fast operation of Josephson 
circuits at the LSI level. Chip functions similar 
to those of the Am 2901 Microprocessor made 
by Advanced Micro Devices Inc” were selected 
because this microprocessor is considered 
the standard 4-bit microprocessor slice. A GaAs 
version of the Am 2901 Microprocessor has also 
been developed*”, Consequently, the capabili- 
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Fig. 31 —Sixteen-bit ALU. 
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Fig, 32—Four-bit microprocessor diagram, 
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Fig. 33 —Four-bit microprocessor. 


ties of the Josephson microprocessor can now 
be compared with those of semiconductor 
microprocessors having the same functions, 
and the performance potential of Josephson 
circuits can be clarified. 

Figure 32 diagrams the Josephson micro- 
processor we designed. This microprocessor 
features a dual memory set used as 16-word 
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by 4bit 2-port RAM with a RAM shifter, 
8-function ALU, Q register, and several control- 
lers. This circuit is driven by 3-phase power ¢,, 
2, and $3. Dual-rail logic was used in the ALU 
and the controllers of the microprocessor, and 
complementary signals are made from the input 
signals with TIs powered by ¢,. Decoding is 
run in gates powered by @¢,, memory data 
reading powered by @¢,, and data modification 
and writing powered by ¢3. 

Both the minimum junction diameter 
and line width are 2.5 um. The interconnecting 
lines are 4 um wide. Figure 33 shows the chip. 
The basic gate is the MVTL, and there are 
a total of 1 841 gates. 

The critical path of the carry signal is trans- 
mitted from the least significant bit (LSB) 
to the most significant bit (MSB) in the ALU, 
then the sum signal is transferred from the ALU 
to RAM. Forty-one gates are required to switch 
sequentially along the path, with an inter- 
connecting line length of 15 mm. MVTL gates 
operate with a sub-10 ps gate delay in the actual 
circuits as previously described, and the propaga- 
tion delay in the interconnecting lines is about 
8 ps/mm. The critical path delay time may be 
roughly estimated about 0.5 ns, and a sinusoidal 
power duty ratio is about 1/2. Thus, the maxi- 
mum clock frequency was estimated at 1 GHz. 
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Fig. 34—Operation waveforms of the 4-bit 
microprocessor. 


Table 4. Four-bit microprocessor performance 


Maximum clock Power dissipation 


Device (MHz) (W) 
Si 30 1.4 
GaAs ie 2.2 


Josephson 770 0.005 


7 Data out 


RAM vot Carry out 


4-bit slice 


microprocessor Multiplier 


8-function ALU 
64-bit 2-port RAM 
4-bit Q register 


Register 


Power 
(clock) Instruction/data ROM 


256-word X 32-bit 


Sequencer 


lowe in 


Fig, 35—Four-bit processor diagram. 


The critical current density of the fabricated 
Josephson junction was 2300 A/cm?, which 
is slightly higher than the optimum designed 
value of 2100 A/cm?. The measured operating 
margin was +34% for the MVTL OR gate and 
+32% for the unit cell. All functions and source 
combinations were confirmed with an operating 
margin of +16% at a clock frequency up to 
100 MHz, which is the maximum clock fre- 
quency of the word pattern generator. 
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Fig. 36 —Four-bit processor, 


Operation along the critical path of the chip 
was tested using the high-speed pulse generator, 
with correct operation confirmed up _ to 
770 MHz as shown in Fig. 34. The gate power 
dissipation was 3.6 uW/gate, and the total 
power consumed by the chip was 5 mW. 
Josephson microprocessor operation using 
a clock frequency one order of magnitude 
faster than, and power consumption three 
orders of magnitude less than a semiconductor 
microprocessor were confirmed. Table 4 com- 
pares the performance capabilities of the Am 


2901 Microprocessor for three different materi- 
als?4)49):47) | 


5.4 Four-bit processor” 


The microprocessor was expanded into a 
more complete processor by adding a 4-bit 
multiplier, a 12-bit accumulator, 8 K-bit instruc- 
tion ROM, and a sequencer. Figure 35 diagrams 
the 4-bit processor. Figure 36 shows the 
fabricated chip. The minimum junction diameter 
was reduced from 2.5 um to 1.5 wm and, 
accordingly, the gates and memory cells were 
made smaller. Consequently, the Am 2901 
Microprocessor was reduced in size from 
5x5mm? to about 2x2mm?. Thus, the 
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Fig. 37—Eight-bit DSP diagram. 
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D register control 
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Fig. 38—DSP processing sequence, 


integration of all components, including 
3056 gates, on a chip 5x 5mm? was made 
possible. 

The ROM access time was 100 ps for the 
far-end ROM cell. The multiplication time 
was 200 ps for operation along the multiplier 
critical path. Correct 4-bit microprocessor 
operation was also confirmed along the critical 
path up to a clock frequency of 1.1 GHz. 
The maximum clock frequency increased 
due to the 1.5-u~m technology employed to 
reduce the microprocessor size. The power 
dissipation of the chip was 6.1 mW. Because 
the multiplier, microprocessor and ROM are 
operated by different phases of three pipelines, 
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the multiplier and ROM are fast enough to be 
operated at a clock frequency of 1.1 GHz. 
Thus, the operating speed of the chip is limited 
by the clock frequency of the microprocessor. 


5.5 Eight-bit Digital Signal Processor: DSP 

We have designed an 8-bit DSP to achieve 
high-speed system-level performance. The DSP 
includes a multiplier, ALU, instruction ROM, 
coefficient ROM, two data RAMs, a sequencer, 
and other control circuits. The DSP performs 
the same functions as commercially available 
DPSs and offers comparable performance 
capabilities. 

Figure 37 diagrams the DSP. The architec- 
ture is based on that of the Si DSP*®. Figure 38 
shows the 3-stage pipeline processing sequence. 
Both the 8 x 8 bit multiplier and 13-bit ALU 
are active in the third stage. The DSP thus 
completes a series of multiply-add operations 
and a multiply-add operation using previously 
added data once every machine cycle. With 
this design, the data processing rate using 
previous data is the same as the internal machine 
clock frequency. 

Data memory consists of two 16-word by 
8-bit RAMs, RAMA and RAMB. Each has 
its own addressing circuits. Program ROM 
(IROM) is 64 words by 24 bits. Each 24-bit 
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Fig, 39—24-bit instruction code format and immediate data format. 


instruction is divided into seven fields as shown 
in Fig. 39. The DSP contains a special 16-word 
by 8-bit memory area (CROM) for coefficient 
storage. 

The I/O port of this DSP consists of data 
registers and a controller. Output data is stored 
with an independent external clock. This port 
helps interface the high-clock-speed Josephson 
DSP with low-clock-speed semiconductor sys- 
tems. 

Such Josephson devices, as the 69 x 52 yum? 
unit cell for the logic gate, 26 x 16 um? ROM 
cell, and 133x48 um? data RAM are built 
into this DSP. The MVTL OR gate in the unit 
cell occupies 26 x 21 um?. Each data RAM 
cell stores data in a superconducting persistent 
current, and has a sense amplifier to increase 
the operating margin. The minimum junction 
diameter is 1.5 wm and the minimum line width 
is 2pm. The 2-layer superconducting inter- 
connections are 3 ym wide with 2 wm spacing. 

Figure 40 shows the DSP chip. To reduce 
the level of complexity, both the multiplier 
and ALU use the ripple carry method. Table 5 
lists the chip specifications. There are 6300 
gates and the chip is 5 x 5 mm? in size. 

The operating speed of each circuit in the 
DSP was measured, The carry propagation 
delay in the multiplier was 240 ps. Forty gates 
must switch sequentially along a 3.6 mm path. 
Because the propagation delay is 8 ps/mm, 
the average gate delay is estimated at 
5.3 ps/gate. For the ALU, the delay was 410 ps. 
The critical path in the ALU is the carry pro- 
pagation route in adder mode, and 44 gates 
must switch sequentially along a 4.2-mm path. 
The average delay is estimated at 7 ps/gate. 
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Fig. 40—Eight-bit DSP. 


Table 5. DSP chip specification 
— 
I 


tem Specification 
Gates 6 300 
Josephson junctions 23 000 
Minimum-junction size 1.5 um 


Instruction 
ROM Coefficient 
Data RAM 
Multiplier 
ALU 


Chip size 


64-word x 24-bit 
16-word x 8-bit 
16-word x 8-bit x 2 
8-bit x 8-bit 

13-bit, 16 functions 


5.0mm x 5.0mm 


The access time of the far-end IROM was 

200 ps, and that of the RAM cells was 130 ps. 
The internal machine cycle is limited by 

the stage 3 pipeline. Both multiplication and 
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addition are completed in this stage. The total 
delay of both multiplier and ALU is 650 ps. 
The maximum clock frequency is estimated 
at | GHz with sufficient margin for an inter- 
connection propagation delay between the com- 
ponent so that a nonparallel processing speed 
of 1 GOPS (Giga-Operation Per Second) can be 
attained. This is 100 times faster than the con- 
ventional CMOS DSP, with about one tenth the 
power consumption®? 
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Fig. 41—Conventional memory cell. 
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6. Memory circuits” 


It was once considered that high-speed 
Josephson cache memory capacity up to the 
4 K-bit level was too difficult to achieve”). No 
one actually built a memory circuit with sub-ns 
access time until 1988. We discarded conven- 
tional memory in an attempt to develop a 
completely new memory circuit with a new 
memory cell, decoder, and driver circuit. 

Figure 41 shows the circuit diagram of 
a conventional memory cell, with which | K-bit 
memory circuits were fabricated. Access time 
as fast as 570 ps has been reported*!). The 
development of higher-density memory was 
complicated due to the difficulty in reducing 
the cell size used in conventional memory 
design. Other problems were posed by the small 
fan-in and fan-out of Josephson gates and 
their lack of sufficient drivability. To solve 
these problems, a new memory circuit with 
a 4K-word by 11-bit configuration was 
developed” : 


6.1 Memory cell 

We developed a_ capacitively coupled 
memory cell in which information is stored 
as a superconducting persistent current in 
an rf SQUID. The rf SQUID is a superconduct- 


Internal magnetic flux 


Offset 
current 


Wi 


&R 


= Address current 


b) Memory operation on internal flux vs external 
flux curve (W, denotes write 1,andW, & R 
denotes write 0 and read) 


Fig. 42—Capacitively coupled memory cell. 
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ing loop with one Josephson junction. The cell 
consists of one rf SQUID, a sense junction, and 
two capacitors as shown in Fig. 42a). Informa- 
tion is read by the sense junction connected 
to the rf SQUID through the capacitors. One 
memory cell is selected by two magnetically 
coupled control lines. 

The following describes memory cell 
operation and Fig. 42b) shows the relationship 
between internal flux ©; and the external 
flux caused by the x- and y-address currents. 
The regions with a positive slope (bold line) 
are stable, and negative-slope regions (dashed 
line) are unstable. Thus, two stable states, 
denoted by quantum states O and 1, exist for 
the properly shifted point of the address current 
with an offset current. The states correspond 
to binary O and 1. The address current is positive 
for writing 1 and negative for writing O. For 
reading, negative-polarity address current is 
applied and the sense junction is biased below 
the critical current. When | is stored in the 
rf SQUID, it goes from quantum state 1 to 
quantum state 0. A small pulse appears across 
the junction of the rf SQUID when it changes 
the states. The pulse voltage multiplied by the 
pulse width almost equals the value of flux 
quantum @®,). For example, a pulse voltage 
of 200 uvV with a 10-ps duration appears across 
the junction. This pulse is transferred to the 
sense junction, causing sense junction voltage 
transition. Although the read operation is 
destructive, rewriting is done immediately 
after reading. 


6.2 Decoder circuit 

There are two ways to construct a decoder: 
using AND gates or using OR gates. The 
advantages of AND gates are that timing pulses 
are not needed. But more gates should be used 
than when using OR gates because the multiple- 
input AND gate is not practical for Josephson 
AND gates. The disadvantages of OR gates 
are that timing pulses are needed and the circuit 
becomes more complex. We have successfully 
constructed a decoder with AND gates. To 
reduce the number of logic stages needed 
for decoding, we developed an AND gate with 
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Fig. 43 —Equivalent circuit of AND gate for decoder. 
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Fig. 44—AND gate. 


a fan-out of 4. A 6-bit to 64-bit decoder that 
selects 64 address lines with 6-bit address signals 
can be constructed with three AND gate stages 
with a fan-out of 4. 

Figure 43 shows the equivalent circuit of 
the AND gate. Two 4-junction logic (4JL) 
gates are used to isolate the input and output 
of the previous gate. One junction is used 
as an AND gate that operates like a member 
of the MVTL gate family. The output current 
is amplified by the other single junction, which 
was biased independently of the 4JL gates, 
and the fan-out was increased from 2 to 4. 

Figure 44. shows the fabricated gate, 
83 x 78 um? in size, with a 4m line width. 
We designed a 6-bit to 64-bit decoder with 
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this AND gate and confirmed its correct opera- 
tion. The decoding time was measured at 90 ps, 
which is the fastest decoding time yet recorded. 


6.3 Driver circuit 

Since memory cells were arranged in a 
64-by-64 configuration, each driver gate had 
to drive 64 cells connected in series. The drive 
line can be considered a transmission line, 
provided the propagation delay time for one 
memory cell is less than the rise time of the 
drive current. This condition is usually satis- 
fied because the propagation delay time is less 
than lps and the rise time exceeds 10 ps. 
For matched transmission line, drive current 
Iq is given by Vo/Zo, where Vo is an output 
voltage of the driver gate and Zp is the charac- 
teristic impedance of the drive line. Since Zo 
was 18 Q and Jg is 0.6 mA in our cell design, 
the output voltage of the driver should be 
10.8 mV, which is about 4 times the gap voltage 
of the Nb/AIO, /Nb junction. 

Since simple series-connected junctions do 
not switch simultaneously, it was considered 
too difficult to obtain an output voltage larger 
than the gap voltage of one junction, that is, 
2.8mV for the Nb/AlO,/Nb junction. We 
have developed a gate that generates a voltage 
higher than the gap voltage. Figure 45 shows 
the circuit configuration. The gate consists 
of two branches, each consisting of four 
junctions and one resistor connected in series. 
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Fig. 45—Equivalent circuit of driver gate. A cell drive 
line is also shown. The critical current of all 
junctions is 4.2 mA. The resistance for each 
of the two resistors is 1.5 Q. 
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The bias current is supplied to the gate midway 
between the two branches. The critical current 
of junctions and the resistance are designed 
to be equal for both branches. The next chapter 
describes the principles of the gate operation. 

This driver circuit was fabricated to include 
64 memory cells. The measured rise time at 
the far end of the drive line was 60 ps. 


6.4 Memory operation 

Figure 46 shows the 4K-bit memory 
designed and fabricated with the circuits 
described above. The junction 
diameter is 2.5 wm and the minimum line width 
4 um. Each memory cell is 83 x 83 um? in size. 
A rather large cell was used to verify memory 
circuit feasibility without using sophisticated 
process technology. The chip is 7.7 x 7.7 mm? 
in size and includes 14468 junctions. The 
measured access time was 590 ps, and the chip 
power dissipation was 19 mW. 


minimum 


7. Interfacing 

Josephson logic circuits operate at a clock 
frequency above | GHz and their logic swing 
is less than 3 mV. Two factors dictate the need 
for interface circuits between the Josephson 
and semiconductor circuits. One factor is 


1mm 


Fig. 46—4 K-bit memory circuit. 
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Fig, 47—Serial-parallel converter operation principle. 
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converted into 8-bit parallel signals with 
a clock frequency of 125 MHz. 
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amplifying the output voltage from the 3 mV 
of the Josephson circuit to the 1 V needed 
to drive semiconductor circuits at room temper- 
ature. The other factor is converting the clock 


frequency from 1 GHz to 100 MHz. Simply 
reducing the clock frequency will cause 


information to be lost in a Josephson system. 
Consequently, we decided to convert the serial 
output signal from the Josephson system into 
a low-frequency parallel output called a serial- 


parallel converter. Figure 47 illustrates the 
conversion. 
The following sections describe the two 


interface circuits. 
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Fig. 48—Voltage waveforms across junctions driven by sinusoidal current, 
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7.1 High-voltage driver®” 

The output of a Josephson logic gate consist- 
ing of Nb/AlO,/Nb junctions is only 2.8 mV, 
and is determined by the gap voltage of the 
junction. It would seem easy to obtain a voltage 
higher than the gap voltage with series- 
connected junctions, but this is true only for 
series-connected junctions without a load 
resistor. Even in such a case, the scattering of 
critical currents should be small. When a load 
resistor is connected to series-connected junc- 
tions, the junctions do not switch simultaneous- 
ly. Figure 48a) shows the voltage output across 
four series-connected junctions with a load 
resistor. A voltage step appears for each junction 
switching to the voltage state. When one of 
the four junctions switches to the voltage state, 
current flowing through a series-connected 
junction decreases because some bias current 
flows to the load resistor. Thus, bias current 
should be increased to switch the next junction 
to the voltage state. After switching, the current 
for the series-connected junction decreases 
again. Consequently, steps appear in the voltage 
across the junctions as shown in Fig. 48a). 
Thus, the simple series-connected junctions 
cannot be applied to an interface circuit for 
driving semiconductor circuits. 

This problem was solved by connecting 
the series-connected junctions in _ parallel. 
Figure 48b) shows the voltage across the junc- 
tions. Four series-connected junctions are 
connected in parallel, and load resistors are 
connected to the junctions. No step occurs 
and four times the gap voltage V, of the junc- 
tion is obtained. The series-parallel connected 
junctions switch simultaneously because the 
bias current flows equally to both branches, 
and after the bias current increases, the junc- 
tion with the smallest critical current, for 
example, in the left branch, switches to the 
voltage state, then some current flowing through 
the left branch is transferred to the right branch, 
causing an overdrive current for junctions 
in the right branches. Consequently, all junc- 
tions switch almost simultaneously to the 
voltage state. The bias current is transferred 
again to the left branch, and the remaining 
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Fig. 49—Equivalent circuit for the high-voltage gate 
interfacing Josephson junctions and 
semiconductors, 


junctions switch to the voltage state. After 
all junctions switch to the voltage state, bias 
current is transferred to the load resistors. 
This switching is done within a few tens of ps. 
The series-connected junctions can be applied 
to a gate, which should have an output voltage 
higher than the junction gap voltage. 

We designed a high-voltage gate using series- 
parallel junctions as shown in Fig. 49. The only 
difference between this structure and_ that 
shown in Fig. 48b) is the location of the resistor. 
Junctions J,;, to J;, serve as isolation junctions. 
Resistor R, functions both as branch and isola- 
tion resistors. After the isolation junctions 
switch, input current flows to resistor R,, to 
isolate both input and output. The circuit is 
designed so that the critical current in all junc- 
tions J,, to Jy, and Jz; to Jz, is equal. 
Resistors R, and R, are usually equal, resulting 
in a large bias margin. The current gain (or 
gradient of the threshold curve) is set to 1. 
Note that the voltage gain is 4. 

We applied this high-voltage gate with four 
junctions in each branch to supply 4 V, output 
voltage, to the driver circuit of the previously 
mentioned 4 K-bit memory. We also designed 
a high-voltage gate with 52 junctions in each 
branch to build a Josephson driver gate, and 
obtained output voltage of 150 mV. Because 
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this voltage is insufficient for driving a semi- 
conductor circuit at room temperature, the 
output voltage of the driver gate was applied 
to a high electron mobility transistor (HEMT) 
immersed in liquid helium as shown in Fig. 50. 
The HEMT had a gate length of 1 wm and 
a gate width of 50mm, and operated in 
depletion mode. Transconductance g,, increased 
from 200 mS/mm to 300 mS/mm when cooled 
from 300K to 4.2 K. Although the threshold 
voltage shifted about 0.4 V, the HEMT main- 
tained its depletion characteristics. 

Figure 51 shows the interface circuit in 
which the Josephson driver gate and HEMT 
driver are combined. The upper trace is the 
bias current of the Josephson driver and the 
middle trace is the input current. Both are 
negative. The lower trace is the output voltage 
of 0.9V from the HEMT. The HEMT was 
biased with dc voltage of 4 .V. When input 
current was applied to the driver gate, a positive 
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Fig. 50—Interface circuit for converting 3 mV output to 
1 V. It consists of Josephson and HEMT drivers. 
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Fig. 51—Interface circuit operation. 
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output voltage was obtained, then inverted 
by the HEMT. An output voltage swing of 
0.9V is sufficient to drive semiconductor 
circuits. By estimating the switching speed 
of this interface circuit through computer 
simulation, the rise time of the output voltage 
from the HEMT was confirmed at about 100 ps. 
This means that the high-voltage gate is suitable 
for use in an interface circuit. 


7.2 Serial-parallel converter>)) 

We designed an 8-bit serial-parallel converter 
consisting of a 1/8-prescaler that generates 
a pulse every eight clocks, and an 8-bit shift 
register. 

The 1/2-prescaler generates a pulse when 
the number of input pulses is even, that is, 
it outputs one pulse for every two input pulses. 
Connecting three 1/2-prescalers in cascade 
forms a 1/8-prescaler. 

The shift register is almost identical to the 
one previously described. Figure 52 shows 
the 8-bit  serial-parallel converter. Figure 53 
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S,: Output from the 1-bit shift register in each bit. 
P,: Parallel output available only when timing signal is applied. 
Zn: Output from 1/2 prescaler. 


Fig. 52—Eight-bit serial parallel converter diagram. 
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Fig, 53 —Serial parallel converter. 
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shows the fabricated converter, which is 
5.1x1.5 mm? in size, and which contains 
92 MVTL_ gates. Correct operation was 
confirmed at low frequency, and the circuit 
was tested at high frequency. Figure 54 shows 
the operation waveforms at 1.3 GHz. The 
upper trace is the bias current waveform at 
phase $3. The second is a timing pulse Z; 
obtained from the 1/8-prescaler. The middle 
eight traces, S, (m = 1-8), are output voltages 
from the shift register in each stage. Because the 
timing pulse from the prescaler was used as an 
input signal to the shift register, input is applied 
every eight clocks. The input signal was shifted 
from the lower to higher stages. The lower eight 
traces, P, (n = 1-8), are parallel output signals 
from the shift register in response to a timing 
signal. The timing pulse was supplied from the 
prescaler. Thus, the output (100000001000 ...) 
from the 1.3 GHz clock circuit was converted 
to an 8-bit parallel signal with 160 MHz opera- 
tion. Note the test circuit should include a cir- 
cuit prolonging the 8-bit parallel output pulses 
for the duration of reduced clock frequency 
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Fig. 54—Waveforms measured at 1.3 GHz. 
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(about 6 ns in this case). Even so, a prolongation 
circuit can certainly be built with Josephson 
gates. 


8. Progress toward the Josephson computer 

The introduction of the Nb/AlO,/Nb 
junction completely changed the nature of 
Josephson integrated circuits. The stability, 
uniformity, reliability, and reproducibility of 
these junctions finally made it possible to 
create a variety of high-speed Josephson 
integrated circuits. The next step is to install 
these circuits in a cryogenic refrigerator to 
demonstrate their high-speed performance at 
the system level. 

So, what lies ahead for Josephson integrated 
circuits? 


8.1 Josephson computer 

We know that it is possible to fabricate all 
components needed for Josephson computer - 
logic, memory, and interface circuits. Although 
their level of integration hardly matches that of 
semiconductor circuits Josephson integrated 
circuits operate much faster than semiconduc- 
tors. This is because it is difficult to fabricate 
large-scale memory, such as | M-bit or 4 M-bit, 
with Josephson junctions, and thus the memory 
hierarchy of the Josephson computer appears 
similar to the 3-stage structure shown in Fig. 55. 
The Josephson memory will operate at 1 GHz, 
which is the same clock frequency as for logic 
circuits. Such semiconductor memory as 
256 K-bit SRAM and 1 M-bit DRAM is used 
to store the information needed for calculation 
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Fig. 55—Josephson computer diagram, 
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Fig. 56 Small-scale Josephson computer. 
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by the Josephson computer. 

To develop a prototype Josephson computer, 
components must be installed in a cryogenic 
refrigerator, which looks very like a conven- 
tional semiconductor workstation, as shown 
in Fig.56, but which requires a compressor 
to liquefy helium gas through cooling. The 
compressor can also be set up in a separate 
room, if necessary. 

The first Josephson computer will not 
be as large is a semiconductor computer, but 
will be much faster. The smaller scale is not 
caused by technological problems, but is due 
to the relatively small amount of time and 
money expended on Josephson computer 
development projects around the world. After 
the first Josephson computer is introduced, 
the scale of this type of computer will certainly 
increase. 

The most important point to consider is 
what type of system should be installed in 
the first Josephson computer, for determining 
the practicality of the Josephson computer. 
If such a computer is only two or three times 
faster than semiconductor computers, it will 
be discarded. It must be at least ten times 
faster than the fastest semiconductor computer 
to ensure a future. 

One possibility is to make such a computer 
a special-purpose one for SQUID. signal 
processing, for example. The SQUID is used 
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to measure an extremely faint magnetic field. 
Magnetic field from the human body can be 
detected with the SQUID for obtaining magnetic 
cardiograms. To determine magnetic field 
distribution in the body, a large number of 
channels of SQUID are required, because the 
magnetic fields in many places should be 
measured simultaneously. For a 1 000-channel 
system, several thousands of cables would be 
needed to connect the chips in the cryogenic 
container and the electronic circuits at room 
temperature. As described in the following, 
a SQUID system can be integrated with a feed- 
back circuit on a single chip*”)»5) . By using 
such a chip, information that is easy to process 
using the Josephson computer can be obtained 
in digital form from the SQUID, and thus the 
number of cables can be remarkably reduced. 


8.2 Hybrid computer 

The use of  high-Tc superconducting 
materials in interconnecting wiring between 
circuit boards would enable development 
of a hybrid computer system consisting of 
Josephson devices and cooled CMOS (see 
Fig. 57). In such a computer system, large-scale 
signal processing would be done by the CMOS 
computer, while high-speed processing would 
be done by the Josephson computer. Both 
computers would be connected with high-Tc 
superconducting transmission lines. One way 
to promote the actual development of the 
Josephson computer may be to install Josephson 
devices as components of large-scale computer. 

The Josephson previously 
described is also a kind of hybrid computer 


computer 
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because large-scale memory operates at room 
temperature. In such a computer system, semi- 
conductor circuits must be installed very close 
to the Josephson circuits. This is important 
for signal transmission between both types 
of circuits. It is already possible to make a 
cryogenic container in which signal transmission 
is possible within 5 cm between room tempera- 
ture and the temperature of liquid helium™ 


9. Conclusion 

This paper has described our recent progress 
in Josephson integrated circuit development. 
Rapid progress has been made ever since 
niobium replaced lead alloys as the junction 
material. It is now possible to fabricate such 
circuits as processors, memory, and interfaces 
between Josephson and semiconductor circuits. 

The early 1980s were troubling times for 


engineers working on _ fabricated lead-alloy 
circuits that seldom worked as intended. More- 
over, such difficulties as a low fabrication yield, 
the large scattering of critical currents for junc- 
tions, unstable junction characteristics, and mal- 
functions due to flax trapping and punch- 
through, were very annoying. These difficulties 
were considered fatal to attempts being made to 
develop a Josephson computer. Although we 
introduced the niobium junction in 1984, it was 
difficult to obtain a good junction during the 
early stages of development. On the other hand, 
after successfully developing an effective fabrica- 
tion process, the junctions were impressively 
stable, reliable, reproducible, and controllable. 
Thus, most problems were solved simultaneously 
and great promise emerged from great difficulty. 

Part] of this paper details the use of 
niobium junctions. Niobium is possibly the 
best material for Josephson junctions, much 
as silicon is the best material for transistors 
because both are single-atom materials. Because 
compound or alloy materials are difficult 
to use with electron devices, the pairing of 
niobium and aluminum is more effective than 
could have possibly been expected. 

We developed the MVTL gate and 3-phase 
powering for high-speed circuits. The MVTL 
offers many excellent features, the most im- 
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portant being high speed and a large operating 
margin. The 1.5 ps/gate is the world’s fastest 
gate delay. We now plan to develop a subps/gate 
with MVTL. Three-phase powering is ideally 
suited to Josephson junctions operation. Three- 
phase powering makes it possible to ensure 
a smooth stream of data, which is vital for ultra- 
fast circuit operation. 

Without the key techniques described 
in this paper, it would not be possible to create 
Josephson junction high-speed circuts. The next 
step is to install chips in a cryogenic refrigerator 
to demonstrate their high performance capabili- 
ties at the system level. 

To build a Josephson computer, new 
techniques for packaging and refrigeration must 
be developed. Packaging technique is very 
important in terms of building an ultra-fast 
computer because it determines system speed. 
Although little is known about multi-chip 
operation, the development of a high-speed 
multi-chip packaging is essential. Refrigeration 
is also important for the Josephson computer. 
For this reason, a closed-cycle refrigeration 
system that is quiet, small and highly efficient 
must also be developed. We are now developing 
these techniques toward the realization of the 
world’s first Josephson computer. 
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Superconducting Quantum Interference Devices (SQUIDs) can measure magnetic fields as 
low as femto tesla and have been used for biomagnetism, resource surveying, and physical 
or geophysical measurement. Unlike the conventional SQUID, the single-chip SQUID is 
the first device that intergrates both a SQUID sensor and a feedback circuit on the same 
chip. This reduces the number of external cables. Also, the output can be processed with 
Josephson digital circuits in the cryogenic environment. These advantages open up the 
possibility of constructing a multichannel system with an array of more than 100 SQUIDs, 


which is required to measure a magnetic field map in detail. 


1. Introduction 

The superconducting quantum interference 
device is a superconducting ring that includes 
Josephson junctions. It can be used as a mag- 
netic sensor with high sensitivity. From early on, 
two types of SQUIDs (rf SQUID and de SQUID) 
has been used’). The rf SQUID is a super- 
conducting ring that has an inductance and 
a Josephson junction. It is coupled to an LC 
resonator and is fed with an rf current. The dc 
SQUID consists of a superconducting inductance 
and two Josephson junctions. It is biased with 
direct current. Recently the dc SQUID has 
been more commonly used for practical appli- 
cations, because of its higher sensitivity. 

The SQUID can measure magnetic fields 
as low as femto tesla (fT). Other magnetic sen- 
sors, such as the Hall effect device and flux gate, 
cannot measure magnetic fields at less than 
100 pico tesla (pT). Thus, the SQUID is an indis- 
pensable device for measuring magnetic fields in 
the range between 100 pT and 1 fT. It has been 
widely used for biomagnetism?)” , suscepto- 
meter, and geophysical measurement”. Other 
applications *®? are resource surveying, non- 
destructive measurement of magnetic impurity 
in products, gravity wave detection, and mono- 
pole detection. 

Rapid progress has occurred in the area of 
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biomagnetism®’” Many researchers have meas- 
ured magnetic fields from the brain, heart, and 
other tissues. The construction of a SQUID 
sensor array which measures the magnetic field 
map more quickly would be highly desirable. 
The SQUID array is called a multichannel 
SQUID. 

In the continuing attempts to construct 
a multichannel system, the single-chip SQUID®? 
is a promising device. It enables us to construct 
more than 100 channels. This paper reviews the 
principle of operation, design and performance 
of the single-chip SQUID. 


2. Target 
2.1 Multichannel requirements 

For biomagnetic applications, a multi- 
channel SQUID system is desirable for measur- 
ing the magnetic field map. Recently many 
efforts have been made to increase the number 
of channels, i.e. the number of sensors in the 
array, as shown in Fig. 1. BTI developed 5-chan- 
nel (1984), 7-channel (1987), and 37-channel 
(1989) systems. Helsinki University of Technol- 
ogy has made 7-channel (1987) and 24-channel 
(1989) systems. Siemens made a 37-channel sys- 
tem in 1989. Nevertheless, a still larger number 
of channels are said to be necessary for practical 
application. Extrapolations were discussed by 
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Fig. 1—The evolution of the number of channels”, 


Himildinen®) and other participants at the 7th 
International Conference on Biomagnetism as 
shown in Fig. 1. Many more projects on a multi- 
channel SQUID have been undertaken by other 
companies and research organizations. 

It is not yet known what would be the 
sufficient number of channels. As the magnetic 
field map is used to calculate current sources in 
tissues, further research on the inverse prob- 
lem?)>”) might give an answer concerning the 
sufficient number of channels required. The 
following discussion is only a simple estimation 
that we are able to make at this point in time. 
Let us assume that the brain is a sphere with the 
radius of 10cm, and its upper surface area is 
about 600 cm?. In order to cover this whole 
area with pickup coils at each 2 cm mesh point, 
the number of channels required would be 150. 
From this simple estimation, we imagine that 
100 channels or even 1000 channels might be 
required for future applications. 

As the number of channels increases, the 
number of cables required to connect the room 
temperature electronics and the SQUID sensors 
in the cryogenic environment (usually liquid 
helium) increases too. This causes larger heat 
flow and larger crosstalk between cables. The 
former causes a higher level of liquid helium 
consumption. Let us take an example, 1 W 
heat flowing from 300K to 4.2K through 
copper lines 1 m long and with 6.6 mm? cross- 
sectional area’. This would be equivalent to 
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Fig. 2—Block diagram of the SQUID. 


210 thin copper cables of 0.2 mm in diameter. 
The 1 W heat flow consumes 1.4 litres liquid 
helium in an hour. 

When using a closed-cycle refrigerator to 
supply liquid helium for the SQUID system, 
a larger heat flow requires greater refrigeration 
power. For example, the cooling of 1 W requires 
more than | kW refrigeration power. 


2.2 Fundamental operation of SQUID 

A SQUID sensor is operated in a feedback 
loop” as shown in Fig. 2. The SQUID sensor 
receives an input magnetic flux and produces 
an output. The feedback circuit receives this 
output, and returns the feedback flux to the 
SQUID sensor. The feedback flux is given with 
the polarity changed in order to cancel the 
unknown magnetic flux. The feedback loop thus 
maintains the sum of the unknown flux and the 
feedback flux at zero. We can ascertain the 
unknown flux by measuring the feedback flux. 
This null method enables us to measure the 
unkown magnetic flux without being affected 
by nonlinearity of the sensor, or additional 
amplifier drift. 


2.3 Analog SQUID 

The analog SQUID means a conventional dc 
or rf SQUID, where the output is an analog 
voltage. The analog feedback circuit receives 
the output, and gives a feedback flux to the 
SQUID sensor. 

2.3.1 Parallel method 

Until now, all practical multichannel systems 
have been constructed by arranging many analog 
SQUIDs in parallel as shown in Fig. 3a). With 
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this parallel method, each sensor requires a feed- 
back circuit?! which means a large amount of 
electronics operating at room temperature. Each 
dc SQUID requires cables for transmitting bias, 
output, and modulation or feedback. For these 
signals, each requires three pairs of cables. Each 
pair consists of one signal cable and one ground 
cable twisted together. So, for an n-channel 
SQUID system in the parallel method, 6n cables 
are required. If m is 100, the number of cables 
would be 600. On the basis of the previous 
estimation of heat flow, 600 cables would cause 
about 3 W heat flow. This indicates that if we 
wanted to construct a multichannel system with 
more than 100 channels in the parallel method, 
heat flow might be a serious problem. 

2.3.2 Frequency division multiplex 

To overcome this problem, the frequency 
division multiplex method! has been proposed 
and tested as shown in Fig. 3b). In this method, 
the dc SQUIDs are modulated at different 
frequencies to each other. The modulated 
output voltages are transmitted through a single 
output cable. They are demodulated by the 
electronics operating at room temperature. This 
method reduces the number of output cables for 
multichannel SQUID systems based on the con- 
ventional dc SQUID technology. In this case, the 
maximum number of channels is limited by the 
amount of crosstalk caused by nonlinearity of 
the SQUID. 

2.3.3 Time division multiplex 

The time division multiplex method!” has 
also been proposed, as shown in Fig. 3c). In this 
method, many pickup coils and one SQUID 
are used. The superconducting multiplexer 
selects one of the pickup coils and connects it to 
the SQUID sensor. 

In this method, the multiplexer is required 
to maintain the superconducting state in the 
selected connection line and to break the 
superconducting state for those lines not in use. 
One possible candidate device for this multiplex- 
ing operation is a superconducting thin film 
which is heated by the light that is conducted by 
an optical fiber. Another candidate is a super- 
conducting three-terminal device, which can 
select a superconducting or normal state by 
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changing the gate voltage. This method requires 
further study of these multiplexing devices. 


2.4 Digital SQUID 
Another approach is the digital SQUID®  , 

which is described in this paper. 

2.4.1 Definition of digital SQUID 

The digital SQUID is also operated in a feed- 
back loop as shown in Fig. 2. But, unlike the 
analog SQUID, the output of the SQUID sensor 
is a pulse sequence. The digital feedback circuit 
receives this output pulse and gives a feedback 
flux to the SQUID sensor. This is called a digital 
feedback loop. The sensor with pulse output 
can be called a digital SQUID sensor. 

2.4.2 Advantages of digital SQUID 

The digital feedback loop has the following 
two advantages. 

1) The signal-to-noise ratio of the output 
becomes high 
The output cable connecting the SQUID 

chip in the cryogenic environment and the 

electronics at room temperature is typically a 

few meters long. In the analog feedback loop, 

the output voltage to be resolved is of the order 
of nV/Hz'? and therefore very sensitive to 
environmental noise. In the digital feedback 
loop, the output pulse amplitude is typically 

mV. Besides, it is only necessary to ascertain 

whether or not there is a pulse. 

2) The output pulse can be processed by 
Josephson circuits in the cryogenic environ- 
ment 
First, this enables us to integrate the digital 

feedback circuit into the SQUID chip, thus 

creating a single-chip SQUID. In contrast to this, 
the analog feedback circuit requires a lock-in- 
amplifier which is not easy to integrate into the 
cryogenic chip because of the restricted per- 
formance of the superconducting analog circuit. 

As the single-chip SQUID has a fully inte- 
grated feedback circuit, a cable for an external 
feedback loop is not necessary. Each single-chip 

SQUID requires one cable for bias and one cable 

for output. Due to the high signal-to-noise ratio 

of the output pulse, a common ground can be 
used. If we construct a system with n channels, 
as shown in Fig. 4a), the number of cables is 


62 


(ww) Bias 


~ Output 


aan 


Low temperature 
SQUID | 
SQUID 


SQUID 


Low temperature 


Room temperature 


a) Parallel method 


Demultiplexer 


Josephson multiplexer 


Room temperature 


b) Time division multiplex 
Fig. 4—Multichannel system with the single-chip SQUID. 


2n. This is 1/3 of the number required by the 
parallel method with analog SQUID. 

Secondly, as the output is a pulse, it 
becomes possible to process this by Josephson 
digital circuits in the cryogenic environment. 
The stable and reproducible operation of 
Josephson digital circuits have been reported, 
For example, if we used a Josephson multi- 
plexer, a time division multiplex could be 
made! , as shown in Fig. 4b). The outputs from 
many single-chip SQUIDs go to the Josephson 
multiplexer to be arranged in time division on 
a single output cable. 


3. Digital SQUID sensor 
3.1 Principle of operation 

In order to produce the pulse sequence in 
the cryogenic environment, A/D conversion is 
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necessary. The simplest way to do this is to use 
a Josephson comparator which receives the 
unkown magnetic flux and produces a 
comparator output”. 

3.1.1 Josephson comparator 

Figure 5 shows the Josephson comparator. It 
is a 2-junction interferometer. This comparator 
is used in the single-chip SQUID. It receives the 
unkown magnetic flux and produces a com- 
parator output in the form of a pulse sequence. 
The de SQUID is also a 2-junction interferom- 
eter. But unlike the dc SQUID, the comparator 
has no shunt resistor with Josephson junction 
and has a definite hysteretic J-V curve as shown 
in Fig. 6. When the bias current is lower than the 
threshold value, the comparator is in the zero 
voltage state. When the bias current is increased 
beyond the threshold value, it switches to 
the finite voltage state of the order of mV. 
The finite voltage state is maintained until 
the bias is lowered to less than the value, Jpin. 
It is typically much smaller than the threshold 
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Fig. 5—Circuit of a Josephson comparator. It is a 
2-junction interferometer without shunt resistor. 


m Ver. : 0.1 mA/div 
Hor. : 1 mV/div 


Fig. 6—I-V curve of the Josephson comparator. 
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value. 
1) Threshold curve 

The threshold value of the Josephson com- 
parator is a function of the input magnetic flux 
or the current J, flowing through a magnetically 
coupled line. Figure 7 shows the threshold 
curves of the 2-junction interferometer. It con- 
sists of the same closed shapes placed periodical- 
ly along the horizontal axis. The periodicity is 
the flux quantum ®,) which is defined by 
Planck’s constant over twice the electron’s 
charge, h/(2e), and is 2067 x 10715 Wb. Each 
closed curve is called a mode and is numbered 
thus —l-mode, O-mode, l-mode, etc. When 
the operating point is in m-mode, almost m flux 
quanta are entered in the interferometer. The 
theoretical curve on the bending line approxi- 
mation is shown in Fig. 7b). 
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Hor. : 0.25 Do /div 
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b) Theoretical (Theoretical curve is based on bending 


line approximation ) 


Fig. 7—Threshold curve of the Josephson comparator. 
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When the operating point crosses the solid 
threshold curve in Fig. 7b), the interferometer 
switches from the zero voltage state to the 
finite voltage state. On the other hand, when it 
crosses the dashed curve, the interferometer 


changes its state from one mode to another. 
The shape of the threshold curve depends 


on the critical currents (J; and J/,) of 
the Josephson junctions, the inductance L, 
and the ratio of right and left inductances, 
q/p (p+ q =1)'®. The coordinates of the two 
points shown in Fig. 7b) are exact values. The 
curve is symmetrical about its origin. The exact 
whole curve can be obtained to calculate the 
maximum bias current /, using the following 
two static equations. The first equation is 
Kirchhoff’s law. 


I, = 1, sing, +1 sing. 


The second equation is the quantum condi- 
tion. The increase of the phase of the super- 
conducting order parameter is permitted by only 
two times integer n along the superconducting 
closed loop. There are phase differences of $ 
and $2 at the two Josephson junctions. In the 
superconductors, the phase change is expressed 
by the following equation, 


g 21 Pore e OT? s 
a = Ko AL a. )i, wore ha) 
where A is the vector potential, Uo is the per- 
meability in vacuum, A; js the London penetra- 
tion length (85 nm for Niobium), i is the current 
density. The current density is negligibly small 
in the region far inside the superconductor. 
Then, by integrating Equation (2) on the path 
far inside the interferometer ring, we obtain the 
next quantum condition, 


20 
¢,; —o2 + — =2nz, 
Po 


® = pLI; sing, — qL1zsin¢d, — MI,, (3) 


where @® is the magnetic flux interlinked in 
the interferometer ring. 
2) Operation 

The alternating current (ac) bias is applied 
to the 2-junction interferometer. The bias 
amplitude is adjusted to the threshold value at 
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Fig. 8—Operation of the Josephson comparator. 
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Fig. 9—Measured operation of the Josephson comparator. 


the vertical axis as shown in Fig. 8. When the 
input is positive, the interferometer produces 
positive pulses. When the input is negative, 
a negative pulse sequence is produced. The pulse 
shape is definite due to the hysteretic I-V curve 
shown in Fig. 6. So, the interferometer dis- 
criminates between the positive and negative 
input. It operates as a comparator. 

The above is true for a large input. When the 
input is small, however, the fluctuation effect, 
such as thermal noise or external noise, should 
be considered. Due to this, even when the input 
was positive, a negative pulse sometimes 
appeared as shown in Fig. 9. 

As the photograph was taken of the oscillo- 
scope during many scans, the intensity rep- 
resents the probability of the pulse occurrence. 
Higher intensity means higher switching prob- 
ability. Therefore, the comparator output is not 
determined strictly by the input flux; only the 
switching probability is determined by the input 
flux. The switching probabilities, p* for positive 
polarity and p” for negative polarity, changes 
from zero to one in a smooth curve as shown in 
Fig. 10a). As a result, the average number of 
positive pulses minus the number of negative 
pulses, p’ — p-, becomes a smooth curve. Due to 
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Fig. 10—The switching probability p* for positive bias, 
p~ fornegative bias, and their difference p* —p-, 
for three bias amplitudes. 


the smoothness of the curve, it is not necessary 
to adjust the bias amplitude exactly to one point 
of the threshold value. However, it is necessary 
to adjust the bias amplitude to about one 
percent. If the bias amplitude is too large or too 
small, the interferometer becomes insensitive to 
small flux, as shown in Figs. 10b) and c). The 
derivative, 0p/d®, determines the sensitivity 
of the SQUID sensor and the response speed. 
Larger 0p/d® gives higher sensitivity and a faster 
response speed, which will be discussed later. 

Some other variations in the operation of 
the 2-junction interferometer can be considered. 
In the above discussions, the operating point 
moves along the vertical axis with bipolar ac 
bias. But it is possible to move the operating 
point along an axis other than the vertical. It is 
also possible to use monopolar ac bias. Further, 
the modulation of the input current with a 
different frequency enables us to operate it 
avoiding the insensitive region by sacrificing the 
response speed. 

3.1.2 Other methods of pulse output 

Other methods of digitalization have been 
reported. In the first report on the digital 
SQUID’, the operations of receiving signals 
and comparation were divided into two compo- 
nents, a dc SQUID and a Josephson comparator, 
which were integrated on the same chip. In this 
method, the dc SQUID receives the unknown 
magnetic flux and produces an analog voltage. 
The comparator receives it and produces a pulse 
sequence. This is sent to the digital counter at 
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room temperature, which produces feedback 
flux and sends it back to the dc SQUID. This 
method enables us to design optimally for each 
operation, but it requires two interferometers 
with dc bias and ac bias, respectively. The noise 
was limited by the dc SQUID, not by the 
comparator. 

Another comparator called DCFP or QFP!” 
has been proposed and tested. It is a 2-junction 
interferometer contained in a closed super- 
conducting ring. Its output is not a voltage, but 
a magnetic flux. When an excitation current is 
applied, it changes the small input flux to a large 
magnetic flux with the same polarity. To receive 
the magnetic flux change, a superconducting 
storage loop circuit has been proposed!®, This 
is required to control the shape of the short 
voltage pulse and to control the transient 
response for stable operation. It has similarities 
with the superconducting feedback circuit 
described in this paper, but the operation is 
different. Further research will be required to 
characterize this device for use as a magnetic 
sensor. 

In the following sections, we restrict our- 
selves to a discussion of the simplest digital 
SQUID sensor, the Josephson comparator, 
which receives the input flux directly. 


3.2 Sensitivity 

3.2.1 Switching probability 

In the case of the analog SQUID, the 
thermal noise appears as a noise voltage on the 
output? !), The output is the sum of a voltage 
due to the input flux and a voltage due to the 
thermal noise. In the case of the digital SQUID, 
the output is a pulse sequence and the thermal 
noise appears as a fluctuation of the output 
pulse. If we suppose there is positive input flux 
without noise, the digital SQUID sensor 
produces a positive pulse and no negative pulse 
at each bias clock. But, the real SQUID does 
have thermal noise, and as discussed in Subsec. 
3.1.1, a positive pulse is sometimes lost and a 
negative pulse is sometimes produced as shown 
in Fig. 11. The input flux only determines the 
probability of pulse occurrence. Switching 
probability, p of the digital SQUID sensor can 
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Fig. 11—The bias waveform and the output pulse of the 
Josephson comparator. 


be obtained as follows!”. 


The characteristics of the ideal Josephson 
junction is expressed by the following Josephson 
equations?” : 


ee ee ee (4) 
&, d 

gee: Oe a , eeene (5) 
2n dt 


where i is the current and Jp is the critical 
current of the Josephson junction, v is the 
voltage of the junction, @ is the phase difference 
of the order parameter in the Josephson junc- 
tion. The real Josephson junction has a junction 
capacitance and a conductance in parallel. Then, 
the circuit equations of the SQUID sensor are 


© do 
(—¥ (¢ Be gig ce 
2a dt dt ao) 
Bp , , ado doy aU 
an? * dt? a? do (6) 
where the potential U'® is expressed by 
Po Ip 
U ($1, $2) = = (2 — cos¢, — cos¢>) 
ee eee, qt . 
lt Ha ht aes 2 
Po 
+ ———_(@; —@o)"}, wee ee (7) 


4nLIp 


where we assume two Josephson junctions in the 
digital SQUID sensor have the same critical 
current Jj, capacitance C, and conductance 
G. These equations are the same as those for the 
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Fig. 12—A potential U(¢,, ¢2) calculated numerically for 
parameters of LJp =0.33%), P=0, p=0,q =1, 
and J,/Ip = 1. 


motion of a ball in 2-dimensional potential 
U (¢,, 62). A potential map for a certain bias 
current J, is shown in Fig. 12. When a positive 
I, is applied, the potential decreases along the 
(p¢, + qd.) axis with undulation due to the 
cos@ term in the potential Equation (7). For 
small bias /,, the potential has periodical local 
minima. In the mechanical analog, the ball is 
trapped in one of the minima. As the bias is 
increased, the potential barrier becomes smaller. 
If there is no noise, the ball remains in the 
minimum. When the bias current is applied 
beyond the threshold value, the potential barrier 
is lost and the ball starts to move continuously 
toward the lower potential. Phases ¢, and @, 
increases continuously. From the Josephson 
Equation (5), we can see that this corresponds 
to the finite voltage state. In this case, the digital 
SQUID sensor switches from the zero voltage 
state to the finite voltage state. 

Due to thermal noise, the ball moves 
randomly in the neighborhood of the potential 
minimum. Even when the bias current is lower 
than the threshold value, it has a finite prob- 
ability of overcoming the potential barrier. This 
probability p is the switching probability of the 
digital SQUID sensor. 

According to the mechanical analog, the 
hysteretic I-V curve can be understood as 
follows. As we do not shunt the Josephson 
junctions with low resistors, there is only a small 
loss. Then, once the ball has overcome the 
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potential barrier, it continues to fall toward the 
lower potential. Due to the ball’s inertia, the 
voltage state is maintained until the bias 
decreases below the low value, Jmin. Due to the 
hysteresis, the digital SQUID sensor produces 
a definite pulse. 

In the following discussion, we assume that 
the bias waveform is a pulse as shown in Fig. 11. 
The switching probability was obtained on the 
basis of the transition state method?” for one 
dimensional consideration as 

_AU 

Poy =e KT eC | HR BRS ES (8) 

p=1—(1—po)**, 
where AU is the potential barrier, k is the 
Boltzmann constant, 7 is the temperature, 
fy is the Josephson’s plasma frequency 
1=(1/2m)/ 2lo/(@oC)}, and r is the bias 
pulse width. 

Equations (8) and (9) lead us to the following 
reasoning. The ball in the mechanical analog 
oscillates in the potential minimum with a 
frequency of the Josephson plasma frequency 
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Fig. 13—Calculated switching probability vs normalized 
bias amplitude ig (=/,/Io). 
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fp. At every oscillation, the ball attempts to 
overcome the potential barrier. po is the 
probability of overcoming the potential barrier, 
AU, at each attempt of the Josephson oscilla- 
tion. fpt attempts determine the probability of 
switching p in one bias clock. 

When operating in a feedback loop, the input 
flux is around zero; this is sufficient data for us 
to calculate the probability at zero input. The 
potential barrier AU can be obtained by 
numerical calculation of Euation (7). Using 
Equations (8) and (9), the switching probability 
p vs the normalized bias amplitude i, is calculat- 
ed as shown in Fig. 13. The numerical calcula- 
tion of the 0p/0® is also shown in Fig. 14. 

In the above discussion, the probability was 
obtained by the approximation of one-dimen- 
sional transition state method. By extending the 
transition state method?” to the two-dimen- 
sional lossless case, we obtain 


QQ, _AU 
Po = Sfp an eee (10) 
te i he re (11) 


where a, and a, are the square roots of curva- 
tures in the two orthogonal axes at the potential 
minimum, and y, is the square root of curvature 
at the saddle point of the potential barrier along 
the axis normal to the ball’s overcoming move- 
ment. But the calculated result is nearly the 
same as that shown in Figs. 13 and 14. There is a 
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result of about ten percent change in the mini- 
mum flux spectral density. 
3.2.2 Power spectral density of output pulse 
For simplicity, we suppose the output pulse 
is a delta function. The output waveform x (f) is 
written as 


x (t) = [xi 8 — np) — xq 8 ft 


1 1 
37) |. Ly Ae 


«, (138 
fs 


eo oe 
Xn — Pn; 


where bar means the ensemble average. This 
approximation causes an error in the spectrum 
only at a frequency higher than the bias fre- 
quency fg. When we want to know the Fourier 
component at a low frequency region less 
than the bias frequency fg, this approximation 
is not problematic. The ensemble average 
of the Fourier component of the output pulse 
in the time period of 7, Xy7 and the power 
spectral density S, can be obtained by the 
following equations, 


T/2 ; 
Xr) = J xe dt, w= Inf, (14) 


2|X7 (f) |? 


eres 
F , (15) 


S,(f) = lim 
To 
where S, is defined for positive frequency 
f. If the bias amplitude and the signal flux 
do not vary with time, the switching probability 
py, is independent of n. This instance is dealt 
with in Ref.19. Now we wish to consider 
a more general instance, where a small signal 
flux ®,(¢) is applied. Then, the switching 
probability depends on time as expressed by 


Op 


Pr ~ P#(SZ bln). -. 16 
The quantity d, is defined by 
| 7/2 
% (t) =— ®,(t+t')dt’, eee GUA) 
io -+f™ a, 


which has the following Fourier transform. 
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, is the average of ®, at each time. The average 
is calculated by the time period of bias pulse 
width 7. This comes from the fact that the 
switching probability p is determined by all 
of the attempts in a bias clock. This averaging 
effect reduces the high frequency component 
of the signal flux spectral density. The reduction 
is a factor of {sin (w7/2)/(wr/2)|? as shown 
in Fig. 15. 

From Equations (12) to (16), we obtain 
the power spectral density of the output pulse 
X as follows. 


a) 
Sx (f) = 4fgp(1 — p) ~ 4fa GE) 0800) 


+8f,p? > 5|f—(2n+1)fp} 
a = 
nr psy E: Sa(f + 2nfg), (19) 


where ¢ % is the autocorrelation function of the 
d. Sg is the power spectral density of d,, 
In the right side of Equation (19), the first 
and third terms represent the thermal noise. 
The second term comes from the probability 
change due to the signal flux. This is small 
compared with the first term, if the signal 
is small enough. The fourth term comes from 
the signal flux. 

Equation (19) shows the following im- 
portant points. 
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Fig. 16—Spectral density with aliasing effect. 


1) Thermal noise appears as white noise in 
the frequency region below the bias fre- 
quency fg. It is represented by the first 
term on the right of Equation (19). 

2) Signal flux spectrum appears in the 
output, as multiplied by the factor of 
| sin (w7/2)/(wr/2)}?, due to the averaging 
effect. 

3) Signal spectrum appears periodically in 
the output spectral density, with the period 
of 2f, as shown in Fig. 16. This is the 
same as the aliasing effect that appeared 
in sampling. The digital SQUID sensor 
operates as a sampler. 

3.2.3 Thermal-noise-limited sensitivity 

The biomagnetic signal has frequency 
components from 0.1 Hz to 1000 Hz, while 
the bias frequency is typically from 1 MHz 
to 100 MHz. It is sufficient for us to attend 
to the component at a frequency lower than 
the bias frequency fg only. In this case, the 
relationship between the spectral density of 
the output pulse and the signal flux is expressed 


by the coefficient 4f? (dp/d@)? of the fourth 


term on the right side of Equation (19). 

By dividing the thermal noise 4fgp(1 — p) 
by the above coefficient, we obtain the 
equivalent flux spectral density of the thermal 
noise as follows!” . 


As we cannot detect a signal flux smaller 
than this spectral density, this is the funda- 
mental limit on the sensitivity of the SQUID 
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sensor. It is determined by the switching 
probability p and its derivative dp/d®. Figure 17 
shows the numerical calculation for inter- 
ferometer inductance L of 6.9 pH and tempera- 
ture at 4.2 K. Larger L gives lower sensitivity. 
For example, the minimum flux spectral density 
was obtained as 3 x 107° ®,/Hz!? for 6.9 pH 
with the bias pulse width of 1 ns and bias 
frequency at 1 MHz. However, for 15 pH it 
becomes as large as 5 x 107° @,/Hz!?. 

The potential barrier AU cannot be ex- 
pressed analytically. But, if we confine ourselves 
to the small inductance, an approximate ex- 
pression can be obtained. In the limit of zero 
inductance, the 2-junction interferometer can 
be used as a single Josephson junction. In 
this case, AU, and therefore, p, 0p/d®, and S» 
can be expressed analytically. The minimum 
flux sensitivity is expressed as follows)”. 


Po al (ae 


S Lis =K 
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The minimum condition is obtained for the 
bias amplitude that has a switching probability 
of Pm (= 0.79). The value K is 0.05-0.13 for 7 of 
1073-107? s and can be considered as nearly 
constant. Equation (21) gives 1.4 x 10°° ®p /Hz!?, 
which is about half of the numerically calculated 
result shown in Fig. 17. This is the approxima- 
tion error. However, Equation (21) is useful 
as it allows us to know the rough dependence 
of the sensitivity on parameters. The flux 
spectral density is inversely proportional to the 
bias frequency. It is also proportional to four- 
thirds of the temperature and is inversely 
proportional to four-thirds of the critical 
current. This temperature dependence might 
seem ridiculous, but it is due to the nonlinear 
dependence of the switching probability, and its 
flux derivative. 

When the temperature is much decreased or 
the bias frequency much increased, the thermal 
noise lowers. But it is limited by another 
condition called the quantum limit. 

3.2.4 External noise effect 

External noise contributes to the flux 
spectral density. If it is larger than the thermal 
noise, the sensitivity becomes limited by the 
external noise. The external noise can enter both 
as input flux and as bias fluctuation. As for 
the input flux, this has already been treated 
by the second and the fourth terms in 
Equation (19), When there is a fluctuation 
on the bias, it can be treated in the same way. 
By adding the bias fluctuation effect to the 
probability, Equation (16) is replaced by 


dp 
dig 


0 ~ Ps 
g@ = p* (5b (nT) * ( Vig(nTp ). 


The term ip is defined in the same way 
as in Equations (17) and (18). The frequency 
component of the bias noise power at high 
frequency is also reduced by the factor 
| sin(w7/2)/(w7/2)}? due to the averaging 
effect. The spectral density of the output pulse 
is obtained as 
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0 
Sx (f) = 4fap (1 ~p) ~ 4fe (55960) 
JOP ae 8 
~ 4fn(5-? ie ©) 
+8fgp°?= A{f—Qnt 1)fp} 
a _ 
+4F5 (SSPE S8 (f+ 2nfp) 


+4f2CP pS se (f+ 2mfg). (23) 
B dig n=-00 & 


The bias noise also suffers the averaging 
effect and the aliasing effect, as with the 
input flux noise. It appears in the equivalent 
flux spectral density as multiplied by 
(dp/a®)? /(dp/di,)*. Roughly speaking, it is 
the slope of the threshold curve and is about 
(1/E)*. 

In order to suppress external noise, the bias 
and the signal with a frequency component 
higher than fg should be cut with a low-pass 
filter. 


3.3 Structure 

In its early use”, the Josephson junction 
was made by means of point contact. But now, 
for practical use, SQUID sensors have been 
made using thin film technology. Figure 18 
shows the sensor structure. It is the same as 
that of the dc SQUID proposed by Ketchen”, 
except that it has no shunt resistor. The struc- 
ture is a superconducting thin film transformer. 
It has a single-turn superconducting inductance 
covered by multi-turn squre coils above the first 


-Input spiral coil 


Josephson 
junctions 


Bias current 


Lg? 


—— Output 


Fig. 18—Structure of the SQUID sensor. 
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Nb counter electrode 


AlO, tunneling barrier 


Nb wiring 


600 nm 


Nb ground plane 


Si substrate 


Fig. 19—Cross-sectional view of the multilayer structure. 


coil. These layers can be turned upside down. 
The multi-turn coil is called the input coil. 

The sensor is made by multilayer fabrication 
in the same process as the Josephson digital 
circuits! . 
in Fig. 19. The self and mutual inductances 


The cross-sectional view is shown 


are calculated by the following equations”. 
Pes ZS sss a Bra rangienn (24) 
YS eae) er re er (25) 
bey = EE sa bv ewww d (26) 


where n is the number of turns. Equation (24) is 
valid for w/d much larger than 1. L, comes 
from the flux uncoupled to the SQUID 
inductance ZL. This is the flux interlinked in 
the insulator between the multi-turn coil and the 
SQUID’s single-turn coil. L, is calculated by 
the following equation of the superconducting 
strip line. 


— Mo (At aAp* +az,*)ls 
Kw 
ii 


Ls 


= Ted. (27) 


AL} = AL; coth ( 


l 
where / is the thickness of the insulator, w is 


the width of the multi-turn coil, K is the fringe 
factor’? , ¢; and Ay, are the thickness and 
London penetration depth of the single-turn coil 
(i=1), and the multi-turn coil (i = 2), respec- 
tively. /, is the length of the multi-turn coil. 
The fringe factor K can be approximated by 


(1 + 4h/w). Typically, L, is much smaller than 
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a) First order b) Second order’ c) Planar first order 


Fig. 20—Three kinds of pickup coils of gradiometer. 


n?L, the coupling coeffient k? can be as high as 
0.9. 


3.4 Coupling to the input flux 

3.4.1 Pickup coil 

The superconducting pickup coil receives the 
unknown magnetic field. It is made with super- 
conducting wires or thin film line patterns”?. 
Typically, their dimensions are 2 cm or 3 cm in 
diameter. The magnetic field measured is of the 
order of 10 fT, which is about six orders lower 
in magnitude than the environmental magnetic 
field fluctuation. To pick up this low magnetic 
flux, electromagnetically and magnetically 
shielded room are used. But, because it is not 
easy to obtain the shielding factor of 10° by 
them, a gradiometer is commonly used”. The 
first order gradiometer has two coils placed in 
parallel which are anti-wound to each other as 
shown in Fig. 20a). It is insensitive to the 
uniform field and is more sensitive to the field 
varying in space. As the signal source is placed 
near the pickup coil, and the source of the 
environmental field is generally far from the 
pickup coil, the environmental field varies more 
slowly than the signal field that is being meas- 
ured. So, the gradiometer less sensitive to the 
environmental field. The second order gradiom- 
eter shown in Fig. 20b) is also used to obtain 
higher suppression of the environmental field, 
but is less sensitive. Usually, the coil imbalance 
can be made as small as 1/1 000. 

Typical gradiometer coils have been made 
by winding a superconducting wire around a 
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a) Direct connection 


b) Fractional-turn 


c) Use of intermediate transformer 


Fig. 21—Various methods for inductance matching. 


cylinder. The planar coil configuration with 
superconducting wires as shown in Fig. 20c) 
has also been proposed and tested. 

For strict balance of the gradiometer coil, 
it is useful to make the coil with the lithography 
technique. Some authors reported that the pick- 
up coil was integrated on the same chip!3))25)_ 
Nakanishi?*? reported that a couple of dc 
SQUID magnetometer chips with a single-turn 
pickup coil were used, and in this case their 
output data is subtracted to suppress the envi- 
ronmental field. 

3.4.2 Direct connection 

In many cases, the pickup coil is connected 
to the input coil of the SQUID as shown in 
Fig. 21a). To maximize the transmitted flux, the 
input inductance L;, is designed to be the same 
as that of the pickup coil L,. The pickup coil 
has a relatively large inductance of the order 
of 1 wH. However, the thin film SQUID induc- 
tance is designed to be from 10 pH to 100 pH. 
In order to match these inductance differences, 
the number of turns n has been designed as 
typically a few tens or 100. 

The maximum transmission coefficient, 
which is defined as the ratio of the flux inter- 
linked with the SQUID sensor ® to the flux 
interlinked with the pickup coil ®, is obtained 


as 
/L = 


(Lp - Lu) 


= (28) 
®, on * 
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As the input coil is shunted by the pickup 
coil, the inductance of the interferometer 
becomes L{1 —M? /(Li, + Ly)} instead of L. 

3.4.3 Fractional-turn 

The fractional-turn type has been proposed 
and tested as an alternative to using one super- 
conducting inductance?®), The inductance of 
the SQUID sensor is divided into many coils 
as shown in Fig. 21b). The maximum flux is 
transmitted, when the next equation is satisfied. 


Dorin MME Bane (29) 
The transmission coefficient is 

® 1 ie 1 

—_—= -/— = : ow aloo) 

®, 2 his 2mn 


Compared with the direct connection, this 
method helps reduce the number of input 
coil turns n. 

The fractional-turn type is reported to have 
a resonance problem, because the many-turn 
coils have a larger parasitic capacitance?” The 
resonance deforms the I-V characteristics of 
the dc SQUID and decreases the sensitivity. 
In this case, a Josephson junction is shunted 
by a low resistor, and the I-V curve is non- 
hysteretic. But in the case of the digital SQUID, 
the interferometer has no shunt resistor and its 
I-V curve has a large hysteresis as shown in 
Fig. 6. As the interferometer is used to switch 
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from the zero voltage state to the finite voltage 
state with a larger load resistor, the operation 
is not affected by the resonance. 

3.4.4 Intermediate transformer 

Another alternative is the intermediate 
superconducting transformer, as shown in 
Fig. 21c). The next condition does not maxi- 
mize the transmission coefficient, but is near 
to this maximum condition. 


Ey Lg MPI OP Wk, aeizrs (31) 
The transmission coefficient is expressed as 
oy 1 E ] 
SN a ae eee (32) 


6 4%%, ah 


The use of the intermediate transformer reduces 
the transmission coefficient by a factor of 2. But 
the transformer is capable of adjusting the 
input inductance without changing the SQUID 
sensor. Knuutila?” reported that the inter- 
mediate transformer is used to attach the input 
coil, made on a different substrate, to the 
SQUID chip. 


4. Digital feedback loop 
4.1 Principle of operation 

The digital SQUID sensor is operated in a 
digital feedback loop. In this loop, the digital 
feedback circuit receives a pulse sequence and 
produces a feedback flux. The transmission 


Output 


Feedback circuit 


Feedback 


Fig. 22—Operation of the digital SQUID. 
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function of the feedback circuit relates to the 
response speed of the feedback loop. A simple 
choice in this case is integration. 

In this case, the digital feedback  cir- 
cuit is designed to operate as an up/down 
counter® 13). Figure 22 illustrates the feedback 
operation. If the input rises in a step function, 
the SQUID sensor produces positive pulses. 
The feedback circuit counts this pulse sequence 
and sends back the feedback flux to the SQUID 
sensor with negative polarity. The SQUID 
sensor receives both the input flux and feed- 
back flux. The SQUID sensor measures the input 
flux minus the feedback flux. This decreases 
as the feedback flux increases. When it becomes 
zero, the SQUID sensor stops producing a 
positive pulse. To be precise, the SQUID sensor 
produces both positive and negative pulses 
with equal probability, and the average feedback 
flux does not increase any more. In this way, 
the feedback loop operates to maintain the total 
input flux at zero. We can know the input flux 
by measuring the pulse sequence. For example, 
there are four positive pulses in Fig. 22, so we 
know the input is four times the quantized flux. 
This scheme is the same as that of the delta 
modulation?®?. 

When the input is cancelled by the feedback 
flux, the sensor produces both positive and 
negative output pulses with the same polarity, 
typically adjusted in the range from 0.5 to 
0.8. That can be changed by adjusting the bias 
amplitude as discussed in Subsec. 3.2.1. 

The dynamic range of the SQUID operation 
is limited by the capacity of the feedback 
circuit to count the pulse. To measure the input 
waveform with adequate resolution, a dynamic 
range of 10° seems to be sufficient. But in the 
practical measurement of biomagnetism, the 
presence of external noise means that a larger 
dynamic range is necessary. The dynamic range 
of 2!®, that is, about 5 x 105, will be sufficient. 


4.2 Response speed 

4.2.1 Maximum slew rate 
1) Fixed step size 

Response speed depends on the quantized 
feedback flux A® and the bias frequency fg. 
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When the input flux rises in a step function 
with the amplitude much larger than the noise 
level, the feedback flux increases at a constant 
rate as shown in Fig. 22. This rate is the maxi- 
mum slew rate®) and is expressed as A®f,. It 
can be increased by increasing the bias fre- 
quency or the quantized feedback flux. But, 
if the quantized flux is larger than the noise 
level of the sensor, the quantization error 
will limit the sensitivity. In order to obtain high 
sensitivity, the quantized feedback flux should 
be near the noise level of the SQUID sensor. 
However, it is possible to increase the maxi- 
mum slew rate, by sacrificing the sensitivity. 
2) Adaptive modulation 

To improve the slew rate without sacrificing 
the sensitivity, the use of variable step size, 
or adaptive modulation, has been used and 
tested?®):?® In this method, the quantity of 
the quantized feedback flux is changed by the 
characteristics of the pulse sequence. If only a 
positive pulse continuously arrives, this means 
that a large positive signal flux is applied to the 
SQUID sensor. In this instance, the quantity 
of the feedback flux for each arriving pulse 
is increased. Experimentally this method has 
been shown to improve the maximum slew 
rate by three orders of magnitude. This has 
been done using the external digital feedback 
loop, but it could be replaced by Josephson 
logic circuits. 

4.2.2 Cutoff frequency 

Due to the aliasing effect discussed in 
Subsec. 3.2.2, the output pulse has the signal 
flux spectrum shifted by integer times twice 
fg. But the bias frequency is typically more 
than 1 MHz and is much higher than the signal 
band width which typically ranges from 0.1 Hz 
to 1000 Hz. So, when we confine ourselves to 
the low frequency region below the bias fre- 
quency and small signal, the SQUID sensor is 
equivalent to a linear transducer with the 
transfer function of 2fg (dp/d®) as shown in 
Equation (19). Thus, a block diagram of the 
feedback circuit can be drawn as shown in 
Fig. 23. The relationship between the input flux 
®, — Pppz and the output pulse X is obtained as 


74 


Bias frequency fe 


Input . 
Os(f) — Ors(f) 
+ sensor 


Output 
x(f) 


Unknown 
magnetic 
flux Os(f) 


2fa oF 


Reproduced 


Feedback waveform Y (/) 


magnetic 
flux Ors(f) 


Feedback 
circuit 


A 
lw 


Fig. 23—Block diagram of the digital SQUID for small 
input flux. Only the frequency component- 
below the bias frequency fg is considered. 


op 
X(w) = 2fp(ss) 1Ps(w) — Pra (w)} - 
The feedback circuit operates as a counter and 
is expressed as 


A® 
Ppp (w) = 7 X (w). 


Substituting Equation (34) into Equation (33), 
we obtain 


®.(w) 
co) = 
FB (@) 1 +iw/w, 
2f age. (35) 
Wo = a Rote e nates: 
: Be ae 


If we count the output pulse at room tempera- 
ture, the counted result Y is proportional to 
@rp and is expressed as 


A 
Y(w) = ~~ Orp (w) 


A® 
A ®.(w) 
A® 1 +iw/w,’ 
where A is a unit quantity for one pulse. If 
we use a D/A converter to measure the counted 
result A is the l-bit output voltage of the D/A 
converter. 

This shows that the feedback system 
operates as a first-order low-pass filter. The cut- 
off frequency is A@fg |(1/7)(dp/d®)} . 


axis (OO) 
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Fig. 24—Superconducting feedback circuit. 


5. Single-chip SOUID 
5.1 Superconducting feedback circuit 

The digital feedback circuit can be made 
with an up/down counter and a digital-to-analog 
converter as discussed in Sec. 4.1. It can be 
realized with room temperature electronics 
or cryogenic electronics, as for Josephson 
logic circuits. However, a further alternative 
method has been proposed and tested®. In 
this method, the operation of pulse counting 
is done by placing a quantum flux in the storage 
loop. Figure 24a) shows the circuit. A super- 
conducting inductance and write gate of a 
2-junction interferometer form a loop for flux 
storage. 

5.1.1 Flux quantization 

This method utilizes flux quantization®™. 
In the superconducting loop, the magnetic 
flux is quantized. The flux is permitted when, 

Oi NDRes $$ ©  Siaktewesé (37) 
where n is an integer. In a superconducting 
loop including Josephson junction, or inter- 
ferometer, the above quantization condition 
reappears as, 


Po o+b=n®, 

20 
where @ is the phase difference of the Josephson 
junction or the interferometer. Due to the 
limitation of dynamic range by the write gate 
operation, the first term is less than ®,/40, as 
discussed in the next subsection. For practical 
applications, this small difference of less than 
one quantized flux is not a problem. In the 
case of exact measurement requiring an estimate 
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Fig. 25—The operating trajectory on the threshold curve 
of the write gate. 


of this difference, it is possible to calibrate 
the value. 

5.1.2 Write gate 

The write gate® in Fig. 24a) is a 2junction 
interferometer. It is used to add flux quantum 
when a pulse arrives. Its operation is shown in 
Fig. 24b). Figure 25 shows the threshold curve 
of the write gate. When the pulse comes from 
the sensor, the pulse current flows through 
a magnetically coupled control line and is then 
injected at the left end of the interferometer 
inductance. The operating point moves from 
the origin to point A in Fig. 25. As the operating 
point crosses the threshold curve, it changes 
its state from the zero mode to the first mode, 
At this point, the left Josephson junction J, 
switches transiently and as a result, nearly one 
flux quantum enters the interferometer. When 
the pulse falls, the point returns to the origin. 
In this case, when the operating point crosses 
the dashed curve of the first mode, the right 
Josephson junction J, switches transiently, 
and the flux quantum is transferred to the 
superconducting loop. When the operating 
point returns to the origin, the stored flux is 
exactly one flux quantum. 

To be precise, the operating point does 
not return exactly to origin 0. Due to the non- 
linear inductance of the write gate having a 
Josephson junction, the stored flux is slightly 
different from the flux quantum. The difference 
is expressed by Equation (38). However, it is 
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small, because the loop inductance is of the 
order of 10 wH and is much larger than the inter- 
ferometer inductance of about 10 pH. When the 
critical current flows, the first term in the left 
side of Equation (38) becomes ®,/4. But, even 
when the maximum number of flux quanta 
is stored in the loop, the maximum circulating 
current is about 1/10 of the critical current 
of the write gate as discussed in the next sub- 
section. In this instance, the error is less than 
1/40 of the flux quantum, and can be neglected 
in usual use. 

5.1.3 Dynamic range 

The dynamic range is the same as the 
maximum number of flux quanta. The capacity 
of storing flux quanta is determined by the 
following effect. When one flux quantum 
enters the superconducting storage loop, the 
flux is maintained in the loop by the external 
current. The current shifts the operating point 
trajectory along the vertical axis. As the number 
of the positive stored flux increases, the 
trajectory is shifted down. When it becomes 
the dashed line 0'A’ in Fig. 25, it crosses point C. 
At this point, the interferometer can add no 
more flux quanta. Actually, in this case, when 
the pulse rises, J, switches transiently and 
nearly one flux quantum enters the _ inter- 
ferometer, but when the pulse falls, the same J, 
switches transiently and the flux moves back 
to the left. No flux is added in the super- 
conducting storage loop. On the other hand, 
when many negative pulses continue, the 
operating point shifts upward and if it crosses 
point D, no more negative flux quantum can 
be added. The maximum circulating current 
Ime, Which is defined as the point when no 
more flux can be added, is value 00’. It is 
determined by the write gate design and is 
typically 0.02 mA. The maximum amount of 
stored flux is denoted by Lelmc/Pp. For 
example, 105 for Ly of 10 wH*”. 


5.2 Chip design 

The single-chip SQUID consists of a digital 
SQUID sensor and a superconducting feedback 
circuit as shown in Fig.26. The quantized 
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Fig. 26—Circuit of the single-chip SQUID. 
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Fig. 27—Cover range of the single-chip SQUID. 


Table 1. Parameter dependence of the performance of 
the single-chip SQUID 


Item Specification 
Sensitivity Ab~VJ SoAf 
Dynamic range MgIlmc/A®P 
Response speed 

Slew rate") Az 
Cutoff frequency Ad fp(— oe ) 


Note: Constant step size: AD = (M¢/Lp)Po. 


feedback flux per one pulse is A®. It can be 
adjusted by designing mutual inductance Mr. 
Figure 27 shows the coverage of the single- 
chip SQUID as a function of M;, with a 
parameter of the inductance Lr of the storage 


FUJITSU Sci. Tech. J., 27, 1, (April 1991) 


N. Fujimaki: Josephson Integrated Circuits I: A Single-Chip SQUID Magnetometer 


loop. The upper limit is Mplm,/® 9. The lower 
limit is determined by the quantized flux or 
the thermal noise. 

Table | summarizes the parameter 
dependences of the sensitivity, the dynamic 
range, the slew rate, and the cutoff frequency 
discussed in Sec. 4. The sensitivity is determined 
by the larger values of the noise level and the 
quantized flux. They are designed to be nearly 
the same. The dynamic range is the same as 
the maximum amount of flux quanta that 
can be stored in the storage loop. The slew 
rate and the cutoff frequency can be increased 
by increasing the bias frequency and the 
quantized feedback flux. The cutoff frequency 
is also proportional to the flux derivative of 
the switching probability, dp/d®, which, in 


Feedback 
circuit 


Feedback 


Superconductive loop 


Fig. 29—Circuit of the improved single-chip SQUID. 
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turn, depends on the bias amplitude, the bias 
pulse width, the critical current of the 
Josephson junctions, the attempt frequency, 
the inductance of the sensor, and the operating 
temperature as discussed in Subsec. 3.2.1. 

5.2.1 First chip 

The first chip® includes a figure-of-eight 
coil shown in Fig. 28. The circuit in the chip 
is the same as that shown in Fig. 26. The 
storage inductance Lr is 20 nH. It is made with 
a superconducting strip line with a width of 
5 um on the ground plane. The length is 13 cm, 
The dynamic range is about 200, which is too 
small to be used in practical applications, 
but is sufficient to investigate the fundamental 
operation. The sensitivity was obtained as 
7x 1075 ®)/Hz'” with the bias frequency 
at 500kHz. This value is worse than the 
theoretical value of 4x 107° @)/Hz'’”. The 
cause of this is believed to be external noise. 
But, for the first time, a digital SQUID sensor 
with a superconducting feedback circuit 
integrated on the same chip has been success- 
fully operated. 

5.2.2 Improved chip 

Improved performance, and the practical 
use of SQUID in the field of biomagnetism, 
were the aims in mind when the second chip 
was designed and tested?) The improved 
chip circuit is shown in Fig. 29. Figure 30 
is a photomicrograph of the chip. The pickup 
coil is not contained in the same chip. It is 
intended that the superconducting wire of the 
first or second gradiometer pickup coil will 
be attached to the chip pads. 

The intermediate transformer is used to 
match the input inductance to the pickup coil. 
The feedback transformer is also used to adjust 
the quantized feedback flux A® to be 
5.6 x 10-° @,. 

The storage loop inductance was increased 
to about 30 uwH. It was realized by 32 coils 
without any ground plane. The theoretical 
dynamic range is 1.7 x 105. Because the circuit 
has no ground plane, it might suffer from the 
external magnetic field or the stored flux- 
produces a magnetic field. To prevent this, 
the 32 coils are arranged as shown in Fig. 31. 
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Fig. 30—Photograph of the improved single-chip SQUID. 
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Fig. 31—An arrangement of the 32 coils of the super- 
conducting storage inductance. 


The field produced is proportional to r~ ® where 
r is the distance from the center of the coils 
to the origin of the field. By simple estimation, 
the coils produce only 10 fT at a point about 
2 cm far from the coil, even when the storage 
loop holds 10° flux quanta. 

The Josephson logic gates are used in the 
same chip. The gates are one of the MVTL 
gate family?” and a driver gate. The gates in 
this chip are used only as an output buffer 
which prevents the SQUID sensor from suffering 
form the external noise coming through the 
output cable. In the future it will be possible 
to add more complex logic circuits, for more 
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Fig. 32—Measured operation of the single-chip SQUID. 


data processing. 


5.3 Performance 

Figure 32 shows the results of our experi- 
ments*). The rectangular waveform current 
is applied to the chip as an input. The bias 
frequency was 1MHz. The output pulse 
sequence was obtained as shown in the middle 
of the figure. By counting the output pulse, 
the reproduced waveform was obtained. When 
we increase the rise of the input, the reproduced 
waveform rises as a constant slope at first, 
and then the increasing rate saturates, as shown 
in Fig. 33. This saturation is confirmed to be 
exponential as discussed in Subsec. 4.2.2. 

From the measured time constant, we 
obtained the flux derivative of the switching 
probability. The switching probability itself 
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Fig. 33—Enlarged photograph of the reproduced wave- 
form in Fig. 32. 
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Fig. 34—Measured switching probability p vs normalized 
bias amplitude ig. 


has been measured by changing the bias 
amplitude. The measured p and dp/d® are 
plotted in Figs. 34 and 35. In the experiment, 
the bias was applied as a rectangular pulse wave- 
form having a slight ringing with a peak at the 
rise of the pulse. The experimental data fits with 
the theoretical curve with 1 ns pulse width at 
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Fig. 35—Measured 0p/0® vs normalized bias amplitude i,, 


,Cutoff frequency 18 Hz 
/ estimated from rise time 


== 1.4, & 10° 
®0/Hz 


Frequency (Hz) 


Fig. 36—Measured flux spectral density. 


one point, as shown in Fig. 34. The theoretical 
calculation in Figs. 13 and 14 shows that the 
shape of p and dp/0® curve does not depend on 
T, but only shifts along the horizontal axis. The 
reason for the difference between the experi- 
mental data and the theoretical calculation is 
not yet known. Some external noise might con- 
tribute to this difference, but further study of 
this is required. 

Figure 36 
spectral density. It has a constant level of 
1.4x 1075 ®)/Hz’? at low frequency. It 
decreases at a rate of 20 dB/decade at higher 
frequency. It is in agreement with the response 
speed of the feedback loop with 18 Hz. The 
results of the experiment are summarized in 
Table 2. 

The sensitivity and the response speed 
are improved by increasing the bias frequency 
as discussed in Secs.3 and 4. From the above 
data, the performance at 100 MHz is estimated 


shows the measured flux 
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Table 2. Comparison of the experimental and the 
theoretical performance of the single-chip 


SQUID 
Performance Experiment Theory 
Bias frequency 1 MHz 1 MHz 
Dynamic range +2 x 10 +1.7x 105 


ne 1.4x10°@9/ | 2.7x 10° 
Sensitivity Hz!/2 9 Hz1/2 o/ 
Response speed 


Maximum slew 


rate 4 o/s 5.6 Do/s 
oe sai 18 Hz 320 Hz 
requency 


Table 3. Comparison of the expected performance of 
the single-chip SQUID and the top performance 
of the de SQUID 


Performance Single-chip SQUID de SQUID 
Bias frequency 100 MHz de 
Dynamic range +105 +10° 

-7 -6 -7 
Sinsitivity 3 in ie ap Po/ 
note) a a 
Response speed 
Maximum slew 
sak 400 ®y/s 10° @,/s 
Cut off 
frequency 1.8 kHz a 


Note: Constant step size. 


and compared with the best performance 
of the dc SQUID shown in Table3. The 
dynamic range and the sensitivity are the 
theoretical values. The response speed is extra- 
polated from the 1 MHz experimental results. 
The digital SQUID has the same level of sensi- 
tivity and dynamic range as the de SQUID. 
For practical use in biomagnetism, the sensitivi- 
ty of the conventional dc SQUID is of the order 
of 107° @)/Hz'*. Therefore, a bias frequency 
of 10MHz should be sufficient. As for the 
response speed, the digital SQUID with fixed 
step size is slow compared with the dc SQUID. 
This is due to the fact that the feedback flux 
increases by one quantized flux at most in 
one clock cycle. To improve the slew rate, 
it is possible to use the variable step size or 
the adaptive modulation as described in 
article 2) of Subsec. 4.2.1. 
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5.4 Unsolved problems 

Two problems remain unsolved. They are 
flux trap and crosstalk. The flux trapped in 
the feedback circuit reduces the dynamic range. 
The write gate has been improved and is now 
free of flux trapping. The persistent current 
in the superconducting storage loop, due to 
trapped flux, can be reset by heating with the 
resistor. However, in the sensor itself it is 
not easy to remove trapped flux. The static 
trapped flux can be cancelled by the internal 
feedback operation, although this reduces the 
dynamic range. However, the movement of 
the trapped flux produces extra noise in the 
sensor. That is a serious problem. 

Crosstalk is becoming the greatest obstacle 
to attempts to increase the bias frequency 
of the digital SQUID. The sensitivity and the 
response speed can be enhanced by increasing 
the bias frequency. Typically, when we increase 
the bias frequency higher than 1 MHz, crosstalk 
affects the SQUID performance. Crosstalk 
alters the operating point of the sensor or the 
other circuit element, and causes the wrong 
circuit operation. For much higher frequency, 
the multiple reflection of the pulse and its 
crosstalk determine the switching probability. 
When this occurs, the switching probability 
does not depend on the signal but on the former 
clock output. In the worst case, the SQUID 
becomes insensitive to the signal. 

If the above two problems, flux trap and 
crosstalk, are solved, the theoretical sensitivity 
and response speed of the single-chip SQUID 
can be obtained. 

Recently, the digital SQUID sensor has 
been operated with an external feedback loop, 
a room temperature digital counter and a D/A 
converter, to measure biomagnetism™ The 
magnetic field produced by the heart has been 
measured with sufficient signal-to-noise ratio™) , 
These results show that the digital SQUID can 
be effectively used for biomagnetic purposes. 


5.5 Future prospects 

Figure 37 shows a future target of the multi- 
channel SQUID system using the single-chip 
SQUID. As the feedback loop is included in 
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Fig. 37—A future target of the multichannel system. 


each chip, there is no need for an external 
feedback loop between the cryogenic environ- 
ment and room temperature electronics. This 
helps suppress external noise, and suppress 
the possible crosstalk between the cables of 
each channel. Also, as the output is a pulse 
sequence, unlike the conventional analog 
SQUID, the signal-to-noise ratio is enhanced. 
The use of the Josephson logic gate or buffer 
gate on the output terminal is_ particularly 
useful in preventing external noise from coming 
from the output cable. 

The major advantage of the _ internal 
feedback circuit is the reduction in the number 
of cables required. The single-chip SQUID 
requires only a bias line and an output line. 
When n channels are operated in parallel, the 
required number of cables is only 2n whereas 
the analog dc SQUID requires 6n. 

Processing with Josephson digital circuits 
will be possible, as the multiplexer and the 
adaptive modulation seem to be promising 
candidates. There is also the possibility of 
reducing the number of cables to less than 2n. 
If we use the Josephson multiplexer, the number 
of output cables can be decreased. If all of the 
SQUID output is multiplexed on a single output 
cable, then the total number of cables will be 
n+l. If the bias can be adjusted by some 
method or circuit in the cryogenic environment, 
one bias line might be all that is needed. Then, 
the number of cables would be only one bias 
line and one output line, except for some multi- 
plexer control lines. 
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6. Conclusion 

This paper reviews the digital SQUID. Unlike 
the conventional analog SQUID, the digital 
SQUID produces a pulse sequence. Combining 
the new superconducting feedback circuit, 
the digital SQUID sensor and the feedback 
loop are integrated on a single chip. This reduces 
the number of cables connecting the room 
temperature electronics and the cryogenic chip. 
It enables us to construct a multichannel system 
with more than 100 channels for measuring 
a biomagnetic field map. Further, it will be 
possible to combine Josephson digital circuits 
and have the data processing of the SQUID 
outputs in the cryogenic environment. Although 
the chip performance has been improved, some 
problems still remain. But the experimental 
data up to now shows that the single-chip 
SQUID is a promising device for constructing 
a multichannel system. 
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Monolithic GalnAs twin PIN photodiodes were fabricated to realize excellent dual-detector 
balanced optical receivers in coherent optical communication systems. Introducing a back- 
illuminated, flip-chip structure gives these photodiodes a small junction capacitance of 
80 fF, a quantum efficiency of 75 percent at a wavelength of 1.54 um, and a cutoff fre- 
quency better than 15 GHz. An optical input power level as high as 8 mW is obtained. 
A large fiber alignment tolerance of 60 um is achieved by integrating InP microlenses. 
The common-mode rejection ratio is better than —30 dB at a frequency of up to 10 GHz. 
The performance demonstrated is well suited to high-speed optical coherent communication 


systems. 


1. Introduction 

Coherent optical transmission techniques 
are very important for lightwave communication 
systems because they significantly improve 
the receiver sensitivity and enable very high- 
density wavelength division multiplexing trans- 
mission. The key to this technique is coherent 
optical reception by optical heterodyne detec- 
tion using a local oscillator and a photodiode. 
Since the sensitivity increases with local oscil- 
lator power, photodiodes must also be capable 
of accepting high optical power. 

Achieving an operating speed of several 
Gbit/s requires the following: New optical 
sources such as semiconductor DFB lasers 
having high optical power (>10mW) and 
a very narrow spectrum (< 1 MHz) for the local 
oscillator, and new photodiodes having a high- 
speed response (>10GHz) at high optical 
power injection (>5 mW). 

In designing highly sensitive coherent optical 
receivers, care must be taken when using a local 
oscillator laser because optical power fluctuation 
of the laser becomes a noise source for the 
receiver. 


84 


To suppress this laser intensity noise, dual- 
detector balanced optical receivers (DBORs) 
are often used)”, Key components of the 
DBOR are two photodiodes and a 3-dB direc- 
tional optical coupler. If the photocurrents in 
the two photodiodes are exactly equal, the noise 
due to laser intensity fluctuation is canceled 
by subtracting the two photocurrents. 

For this reason, it is required that the 
quantum efficiency, capacitance, and frequency 
response of the two photodiodes be well match- 
ed. 

Two photodiodes acquiring by monolithic 
integration; twin PIN photodiodes should be 
very convenient for achieving this requirement. 
The authors have already reported the first 
monolithic twin PIN photodiodes integrating 
two | surface-illuminated photodiodes”, The 
authors have also demonstrated that monolithic 
integration is very effective for achieving well 
matched characteristics. 

On the other hand, several authors have 
reported the fabrication of monolithic devices 
integrating an optical waveguide coupler, a 
semiconductor laser and photodiodes for 
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coherent transmission systems®~®, Although 
the use of such monolithic devices is desirable 
in the future, it is very important to obtain 
key components which can be used now to 
construct high-performance DBORs operating 
at several Gbit/s. 

Previous twin PIN photodiodes” , however, 
are insufficient for high-speed response, high 
optical power detection and large fiber align- 
ment tolerance. And these characteristics are 
critical for constructing the practical key 
components for DBORs. 

In this work, we have developed new twin 
PIN photodiodes satisfying all the above- 
mentioned requirements. 


2. Design 
Figure | shows the basic DBOR circuit. To 


obtain high-performance optical receivers having 


PIN photodiode 


Signal 


O 
== Coupler I<. 
Wa $————O Output 
O 


Fig. 1—Dual-detector balanced optical receiver (DBOR). 


Interconnection metal Metal bump 


SI-InP substrate 


sensitivities close to the quantum noise limit, 
the photodiodes must have the following charac- 
teristics: High quantum efficiency, small 
junction capacitance, good optical coupling 
efficiency, and the endurance for high optical 
power injection. The characteristics of the 
two photodiodes must be matched to prevent 
any imbalance that decreases sensitivity. 

To meet the above requirements, the authors 
introduced a monolithic InP microlens”, a back- 
illuminated photodiode structure, and flip-chip 
bonding®) ; 

Figure 2 shows the twin PIN photodiodes”, 
which were integrated on a semi-insulating (SI) 
InP substrate for electrical isolation. 

The light signal is fed into the junction 
from the substrate side (back-illuminated). 
The InP microlens yields a large tolerance to 
optical misalignment and good optical coupling 
efficiency. Introducing metal bumps for flip- 
chip bonding eliminates the need to use bonding 
wire, and hence eliminates degradation due to 
the inductance of the bonding wire. 

To obtain well matched characteristics, a 
symmetrical layout was used for the _inter- 
connecting metal. The circular bump is for 
the electrical signal output and the square 
bumps are for the bias supply. 

To design the junction structure of the twin 
PIN photodiodes precisely, the cutoff frequency 
and quantum efficiency of the back-illuminated 


P-metal p-InP 


N-metal 


Microlens 


Fig. 2—Twin PIN photodiodes. 
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Fig. 3—Cutoff frequency and quantum efficiency as 
a function of GalInAs thickness. d is the PIN 
junction diameter, @ is the absorption coefficient 
of InGaAs at a 1.55-um wavelength, € is the 
dielectric constant of GalnAs, Vy is the electron 
velocity, and Vp is the hole velocity. 


twin PIN photodiodes was calculated as a 
function of the photoabsorption layer (GaInAs) 
thickness by solving continuity equations)” 
(see Fig. 3). 

The following parameters''? were assumed for 
GalnAs: absorption coefficient, wa = 0.68 (1/um) 
(at wavelength X=1.55 =m); dielectric con- 
stant, €=14.1; electron saturation velocity, 
Vsn = 6.5 x 10* (m/s); and hole © saturation 
velocity, Vsp = 4.8 x 10* (m/s). The three solid 
curved lines show the cutoff frequencies. 
Parameter d is the diameter of the PIN junction 
and was calculated from 10 um to 40 um. 

The left side of the peak of the three curves 
is mainly subjected to an RC time constant 
consisting of the photodiode junction capaci- 
tance | see Equation (Al) of Appendix | and its 
50-Q. load resistance. 

The right side of the peak of the three curves 
is mainly subjected to the carrier transit time 


) 
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Fig. 4—Relationship between focal length (f), lens aper- 
ture (®), radius of curvature (R), and thickness 
(t) of the microlens. 


between electrodes, determined by the electron 
and hole velocity. 

The shaped area shows quantum efficiency. 
The lower boundary is determined by the single 
optical path in the thickness of GaInAs. The 
upper boundary is determined by considering 
the complete reflection of incident light at the 
P-metal surface. This shaded area is only 
achieved by a_ back-illuminated photodiodes 
structure in which a high quantum efficiency 
can be obtained with thinner GaInAs. 

On the other hand, to make the photodiode 
able to operate at high speed under high optical 
power injection, it is important to reduce 
the GaInAs thickness so that the space charge 
effect!®!”) is suppressed. Space charges can be 
accumulated in the photo-absorption layer at 
high optical power injection. This decreases 
the internal electric field and therefore slows 
down carrier velocity. 

Taking this effect into account, the GalnAs 
thickness was designed to be 1.4 um for a junc- 
tion diameter of 20 um. A cutoff frequency 
of 16 GHz and a quantum efficiency in a range 
from 62 percent to 84 percent is expected. 

To obtain a large fiber alignment tolerance, 
the InP microlens was designed using simple ray 
optics theory’), Given the focal length (f) and 
refractive index (”f,p) of the InP microlens, 
the radius of curvature (R) can be found from 
R=f("mp — 1)/nMtnp. The lens aperture is given 
by ®=2/(2Rt —t?), where ¢ is the micro- 
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lens thickness. 

Figure 4 shows the results for a refractive 
index of 3.2. 

The focal length corresponds to the thick- 
ness of the InP substrate. To get a fiber align- 
ment tolerance of better than 50 wm, which is 
required for stable optical coupling, the lens 
aperture was set to 80 um and the microlens 
thickness was 9 um. The substrate was about 
130 um thick. Taking into account the 20-um 
PIN junction diameter, the substrate thickness 
can vary by +25 um and the lens thickness can 
vary by +2 um. 


3. Twin PIN photodiode fabrication 

In fabricating the photodiodes, the authors 
started by selective Zn diffusion to form the 
20-um-diameter PIN junction. Au/AuZn 
p-contacts are formed by evaporation and 
alloying, then the slanted mesa structure is 
made by Ar ion beam etching. 

The authors formed Au/AuGe n-contacts 
on the mesa slope, followed by formation 
of the interconnection metal by evaporating 
Au/Ti on SiN film. AuSn/Pt/Ti metal bumps 
were formed after passivation film deposition. 
The substrate was then thinned, and two 
80-um lenses were fabricated'® as shown in 
Fig. 5S. 

Resist patterning and deformation by 
heating were performed, followed by Ar-ion 
beam etching, while the wafer was rotated. 
The purpose of the last step was to deposit 
a SiN antireflection film using plasma CVD. 

The photodiode layer consists of an n-InP 
(n=1x 1016 cm~3) top layer 1 yum_ thick, 
an n-GalInAs (n<1 x 10!5 cm~3) absorption 
layer 1.4 wm thick, and an n*-InP (n= 2 x 10!” 
cm7?) contact layer 1.5 um thick grown on 
a semi-insulating (SI) InP substrate with a (100) 
plane. 

Figure 6 shows the twin PIN photodiodes 
flip-chip-bonded directly to Au bonding pads 
formed on a ceramic mount 0.85 x 1.3 mm?. 
The center-to-center separation of the two 
microlens was 125uyum. The chip’ was 
300 x 250 um. 
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Fig. 6—Twin PIN photodiodes flip-chip-bonded to 
a ceramic substrate. 


4. Characteristics 

Figure 7 shows the photosensitivity profile 
measured by scanning a single-mode fiber having 
a tapered hemispherical lens at the end of the 
fiber (radius of lens curvature: about 20 um). 

The fiber alignment tolerance defined by 
permitting 0.5-dB coupling loss was as large 
as 60um, despite the small PIN junction 
diameter of 20 wm. At the center of this profile, 
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Fig. 7—Photosensitivity profile. 
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Fig. 8—Photodiode capacitance versus bias voltage 
measured before flip-chip bonding. 


the quantum efficiency exceeded 75 percent 
at a 1.54-um wavelength. The mismatch between 
the quantum efficiencies of the photodiodes 
was within 1.4 percent. 

Figure 8 shows the junction capacitances 
as a function of the applied voltage, measured 
before flip-chip bonding. 

The values of 80 fF and 78 fF at —10 V 
are matched to within +3%. With these closely 
matched, the small capacitances are due to 
the symmetrical layout and the extremely 
small PIN junction area. The value of 80 fF is 
thought to be due to the intrinsic PIN junction 
capacitance (60 fF) and the stray capacitance 
(20 fF) between the interconnection metal and 
the n*-InP contact layer. 

Figure 9 shows dark current (J/g) as a 
function of the applied voltage, measured 
after flip-chip bonding. 

The dark currents around 10 V were 6 nA 
and 1.5 nA for photodiodes A and B (labeled 
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Fig. 9—Dark current versus bias voltage measured after 
flip-chip bonding. 
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Fig. 10—Frequency response characteristics monitored 
for an optical input signal having a wavelength 
of 1.54 um, 


in the inset). The slightly larger current for 
photodiode A is assumed to be due to substrate 
leakage. This imbalance presents no problem 
in practical use because it has minimal effect 
in higher bit rate coherent systems that exceed 
1 Gbit/s. 

Figure 10 shows the frequency responses 
measured using the optical heterodyne 
technique. 

The optical power fed into one photodiode 
was about 350 uW at a wavelength of 1.54 um 
and at applied voltages of +10 V. The load 
resistance of the twin PIN photodiodes was 
50 8. An _ excellent balance was achieved 
between the two photodiodes’ responses. The 
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Fig. 11—Common-mode rejection ratio (CMRR) as 
a function of frequency. 


cutoff frequency was above 15 GHz. The 
measured cutoff frequency is almost equal to 
the calculated value of 16 GHz (see Fig. 3). 

Many chips showed the same frequency 
responses, indicating excellent reproducibility. 
This shows the importance of flip-chip bonding 
in fabricating high-speed photodetectors. 

To confirm the suppression of excess 
intensity noise generated by the local oscillator, 
the common-mode rejection ratio (CMRR) 
was measured using a conventional optical 
fiber coupler. 

Figure 11 shows the CMRR_ measured 
using an intensity-modulated DFB laser having 
a wavelength of 1.54 um. Optical power of 
about 650 uW was fed into one port of the 
fiber coupler. Although data was limited by 
the noise in the measurement system, a CMRR 
of better than —30dB was achieved at up to 
10 GHz. 

In a previous paper”, the authors reported 
a CMRR of —30dB at up to 7 GHz and a 
cutoff frequency of 13.5 GHz using similar 
twin PIN photodiodes bonded to a ceramic 
mount. However, this mount did not have 
a symmetrical metallization pattern layout. 

This shows that the mount for flip-chip 
bonding must also have a symmetrical layout 
to improve the CMRR. 

Figure 12 shows the cutoff frequencies 
as a function of photocurrent for one photo- 
diode measured with a semiconductor laser 
having a wavelength of 1.3 wm. The cutoff 
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Fig. 12—Cutoff frequency as a function of photocurrent 
for optical input having a wavelength of 1.3 um. 


frequency was about 12 GHz at photocurrents 
greater than 8mA under a 10-V bias. The 
maximum injection power (Pj,) can be 
calculated by Pin =(hv/e)xIp, where h is 
Plank’s constant, v is the frequency of the injec- 
tion light, and /, is the photocurrent of the 
photodiode. From this equation, the photo- 
current of 8 mA corresponds to the detected 
optical power of about 8 mW. 

This high optical power detection was 
achieved by introducing the back-illuminated 
structure and optimizing the photo-absorption 
layer thickness. In the photo-absorption layer, 
a high quantum efficiency and suppression of 
the space charge effect are simultaneously 
achieved. 


5. Conclusion 

The authors designed and fabricated flip- 
chip, twin PIN photodiodes which can be 
used to construct high-performance coherent 
transmission systems. 

The two photodiodes were integrated 
with a back-illuminated, lens structure having 
symmetrical layout. The twin PIN photodiodes 
satisfied the requirements of high-speed 
response, high optical power detection and 
a large fiber alignment tolerance. 

The cutoff frequencies were better than 
15 GHz. The optical power acceptability was 
more than 8 mW as measured at a wavelength 
of 1.3 um. The fiber alignment tolerance was 
as large as 60 um, despite the small PIN junction 
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diameter of 20 um. The quantum efficiencies 
and junction capacitances of the photodiodes 
were 75 percent at a wavelength of 1.54 um 
and 80 fF for one photodiode. 

These characteristics were closely matched 
between two integrated photodiodes. 

A common-mode rejection ratio of better 
than —30dB at up to 10 GHz was achieved. 
These results demonstrate the importance of 
the present flip-chip, back-illuminated structure 
with symmetrical layout. This device is well 
suited to applications in high-performance 
coherent DBORs. 


6. Appendix 

The single-junction capacitance (Cpyy) of 
the twin PIN photodiode, considering the 
fringing field, is given by 


a € €, 1d? acu ls (a4 ‘. rT ' 
a SS - T —— — 
a ae ee 2W 
; 27 €€gdT in( a2"5 (Al) 
= n —), 
W ii 


Where e is the dielectric constant of GalnAs, 
€) is 8.854x107!2 F/m, d is the diameter 
of the PIN junction, W is the thickness of 
GaInAs, and TJ is the thickness of the p-type 
region. 
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In 1988, the Phase | standard of Synchronous Optical Network (SONET) was established 
by ANSI. Standards committees are planning for Phase Il, which defines the functions for 
maintenance and operations, and which will be generally standardized in late 1990. 

Fujitsu produced the Fiber Loop Multiplexer (FLM) series conforming to the Phase | 
standard for the Regional Bell Operating Companies (RBOCs) before its competitors. The 


FLM was well received. 


This paper describes the SONET systems created by Fujitsu, Fujitsu’s plan to upgrade to 
Phase II, and the key technologies used in the product. 


1. Introduction 

The development of Broadband Integrated 
Services Digital Network (BISDN) is increasing 
the pace of network reconstruction all over 
the world. 

In. North America, the multiplexing hier- 
archy of DS1 (1.544 Mbit/s), DS2 (6.312 Mbit/s), 
and DS3 (44.736 Mbit/s) has been standardized. 
However, multiplexing hierarchies exceeding 
DS3 are generally proprietary. Fujitsu has 
designed and manufactured optical trans- 
mission system products having optical rates of 
135 Mbit/s”, 405 Mbit/s”, 810 Mbit/s? and 
1.8 Gbit/s. Fujitsu has placed these products 
on the market. 

The ANSITI1 committee has _ initiated 
an effort for standardizing a multiplexing 
hierarchy exceeding DS3. The ANSI T1 com- 
mittee includes representatives from manu- 
facturing, Local Exchange Carrier (LEC) and 
Interexchange Carrier (IXC) companies. In 
1988, it established the ANSI T1.105 standard 
under the name of “SONET’”® coordinated 
closely with the international standards body, 
CCITT. This standard, called ‘“SSONET Phase I’, 
specifies functions which allow multivendor, 
mid-span, optical interconnects which transport 
traffic, or in SONET terms, the payload. 
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In addition, in February, 1990, the first 
draft of the SONET Phase II specifications 
were completed. The specifications provide 
for mid-span meets of Data Communication 
Channel (DCC) protocols, new mappings and 
Phase I clarification. Standards are scheduled 
to be submitted for approval this year. 

This paper describes Fujitsu’s approach to 
provide a total SONET transport system. It 
covers the key technologies of Fujitsu’s new 
optical transmission systems with Optical 
Carrier Levels OC-1 (51.84 Mbit/s), OC-3 
(155.52 Mbit/s), OC-12 (622.08 Mbit/s) and 
OC-48 (2 488.32 Mbit/s) of the SONET multi- 
plexing hierarchy. 

This paper also outlines the following 
Fujitsu SONET products: The FLM 50/150 
with OC-1 and OC-3 optical line interfaces and 
the FLM 600 with an OC-12 optical line inter- 
face already in production. 


2. System outline 

The Fujitsu SONET FLM optical digital 
multiplexer systems conform to the SONET 
hierarchy for North America. These systems 
operate at the OC-1, OC-3, OC-12, and OC-48 
rates and can be used to form a SONET net- 
work. 
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Fig. 1—System application for SONET, 


2.1 System application 

Most applications for Fujitsu’s asynchronous 
optical transmission equipment were previously 
in point-to-point networks. For long-distance 
trunk lines, optical repeating equipment was 
placed between the terminals. Lines were 
extended, dropped, and inserted by back-to- 
back terminal installations using hard wiring. 

An expanding network has increasing 
need for such services as Broadband ISDN, 
ring applications by the Add/Drop Multiplexers 
(ADMs), and _ digital cross-connect systems 
(DCSs). These features provide high quality, 
network survivability, and self-healing attributes 
in addition to traditional transmission capa- 
bility”. 

To cope with the various and increasing 
services required by customers, the operating 
companies need to have a network which can 
be flexibly configured remotely according to 
the customers’ requirements. This is the reason 
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why a synchronous multiplexing format, such as 
SONET, was developed. 

The mid-span meet capability was developed 
to allow multi-vendor networks to be created 
which include the signal transport and 
Operation, Administration, Maintenance and 
Provisioning (OAM&P) functions required from 
the operating companies” ’”, 

With this background, the ANSI Tl com- 
mittee and CCITT are accelerating the standard- 
ization of SONET which enables mid-span 
meet. 

Fujitsu is developing the FLM series SONET 
products to provide for flexible implementations 
of the requirements described above. Figure 1 
is an example of the SONET network applica- 
tion that the FLM series can offer. 


2.2 SONET FLM series 
Fujitsu’s FLM series was introduced to 
support the total SONET system approach 
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Table 1. System parameters of FLM series 


item FLM 50 


FLM 150 FLM 600 


Line rate 51 84 Mbit/s (OC-1 or STS-1) 


155,52 Mbit/s (OC-3 or STS-3*) 


622.08 Mbit/s (OC-12) 


Transmission capacity 672 telephone channels 


2016 telephone channels 


8 064 telephone channels 


Multiplexing 


Positive/negative synchronization, byte interleaving 


ata or : : 
Tributary DS1 x 28 (2-system) DS3 x 3 STS-3* x4 
STS-1* x3 4s 
: 7 OC-3* x 4 
VT 1.5 asynchronous mapping floating DS3 direct mapping 
Mapping VT 1.5 bit synchronous mapping floating STS-1 byte interleave 
DS3 direct mapping multiplexer 
Line code Scrambled-NRZ 7 
Light wavelength 1.31 um 2 1.31 wm or 1.55 pm 
. 1.31 um: FP LD 1.31 um: FP LD or DFB 
Ophieal saures Send 1.55 pm: DFB 1.55 um: DFB 
‘ ‘i 1.31 um: Ge-APD 
Optical detector Ge-APD 1.55 um: InGaAs-APD 


Henieeiion Low-speed: 1:7 


High-speed: 1 + 1, power duplication 


Low-speed: 1:4 
High-speed: 1 + 1 
Power duplication 


Switching scheme 


Forced, auto, manual (Lock-in, lock-out, features) 


Switching time 


Less than 60 ms (BER exceeds 10°? at line rate) 


Supervisory 


TBOS, Parallel**, NMA (TL-1 interface), FJT proprietary interface*** 


Orderwire 


2-wire** or 4-wire 


External environmental 


alarm (input/output) Alarm: 8 items, 


Status: 4 items, Control: 4 items 


Craftperson interface 
Performance monitor 
Physical inventory 


ASCII terminal (1 200-baud) 
Low-speed/high-speed bit error, OOF counts 
Plug-in unit data 


Control Loopback, switching 
Alarm/status Summary/detailed alarm, status monitor 
Provisioning Provision of all-plug-in units 
en External reference input: 2 inputs (primary & secondary) 
Synchronieuton External reference output: 2 outputs (primary & secondary) 
*: Future option **: It is provided for FLM 50/150 only. 


***: This interface is used for connection to Fujitsu SV 
system (FAMS: Fujitsu Alarm & Maintenance 
System) and is provided for FLM 50/150 only. 


in the North America transmission market. 
As first generation products, Fujitsu is supplying 
the FLM 50/150, FLM 600 and FLM 2400. 

Table 1 lists the system parameters of the 
FLM series and Fig 2 shows the system hier- 
archy. As shown in Fig. 2, entire SONET hier- 
archies can be constructed from combinations 
of the three types of FLM systems. The follow- 
ing shows an overview of each FLM configura- 
tion. 

2.2.1 FLM 50/150 

The FLM 50/150 is an optical transmission 


OC-48 


OC-48 (2 488.32 Mbit/s) 


OC-12C (622.08 Mbit/s) 
OC-3 x 16 


or 


OC-3C_x 16 OC-3 


srs-ae x16} | |srs-ae xa} | [srs-3 J 


I ead gs 
axel |ssecal| brs | 


p $ ty 
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STS-1E x 3 
DS3 x 12 


OC-3 (155.52 Mbit/s) 
OC-3C (155.52 Mbit/s) 
STS-3E (155.52 Mbit/s) 


OC-1 (51.84 Mbit/s) 
STS-1E (51.84 Mbit/s) 


DS3 (44.736 Mbit/s) 


DSI (1.544 Mbit/s) 
Fig. 2—Hierarchy for SONET FLM series. 
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system used primarily in the loop network. 
The product features an integrated floating 
virtual tributary (VT) or M13 multiplexer 
permitting flexible networks. Other features 
include an operating temperature range from 
=—40°C to +65°C, compact packaging, and 
reduced power drain for installation in a cabinet. 

The main components the FLM 50/150 are 
as follows: 
1) Terminal 
2) ADM-Linear 
3) ADM-Ring 
4) Fiber Hubbing (50 Mbit/s Fiber Extension) 
5) M13/STS-1 Multiplexer. 

2.2.2 FLM 600 

The FLM 600 is a 600 Mbit/s optical trans- 
mission system used primarily in the interoffice 
or trunk network. This equipment can be used 
to transport asynchronous DS3 signals in the 
same manner as conventional Fudtsu asychro- 
nous optical systems, such as the 405/810 Mbit/s 
line terminal equipment. This enables the 


X.25 
Packet switch network 


Packet 
switch 


TL-1 
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operating companies to immediately deploy 
the FLM600 with the capability for future 
expansion. The FML 600 can be upgraded to 
the FLM 2400 without affecting traffic. 

The main components of the FLM 600 are 
as follows: 

1) Terminal 

2) ADM-Linear 

3) ADM-Ring 

4) Fiber Hubbing (150 Mbit/s Fiber Extension), 
2.2.3 FLM 2400 

In the trunking system, ultra high-speed 
optical systems with advanced network surviy- 
ability technologies should be introduced. 

The FLM 2400 is a 2.4-Gbit/s optical 
transmission system to be used in the next 
generation of trunk network. The FLM 2400 
system will be supplied with a 1:n protection 
switching capability. 

As described in Subsect. 2.2.2 the Flm 2400 
can be constructed by upgrading the FLM 600 
or is available as a stand-alone FLM 2400 LTE. 


NMA: Network Monitoring 
& Analysis 


Fig. 3—Current operating system. 
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3. Operations support 
3.1 Current operational support systems 

The requirements for supporting SONET 
have not been completely standardized and 
are under discussion at Tl and CCITT. Because 
the industry is in transition from earlier asychro- 
nous optical systems to SONET systems, we 
must consider connecting SONET equipment 
to existing operations systems (OSs) which 
support asynchronous systems. 

Accordingly, this section describes current 
practices for connection to the network 
monitoring & analysis (NMA) OS deployed 
by the RBOCs in North America. NMA is used 
to record and retrieve information on alarms 
and the status and performance data of network 
elements (NEs). 

It is difficult for Fujitsu to undertake 
the primary development of the interface for 
connection to NMA alone, Fujitsu must work 
on this stage of development with its customers 
and Bell Communications Research (Bellcore). 

The block diagram in Fig. 3 shows the con- 
nection from NMA to the FLM series being 
developed by Fujitsu. Methods of making 


(OSI protocol) 
CMISE 


ISO 8073 AD2 


ISO 8473/X.25 


LCN 


(OSI protocol) 
CMISE 
X.226 


ISO 8073 AD2 
ISO 8473 
IEEE 802.2 


IEEE 802.3 


(OSI protocol) 


such connections are now being discussed 
with the RBOCs and Bellcore. 


3.2 SONET operating systems/SONET Phase I 
upgrade 

The SONET Phase! standard established 
the necessary specifications to ensure a traffic 
mid-span meet. By this we mean that it is 
possible to transmit traffic (DSls, DS3, etc.) 
from a network element from one manufacturer 
to another. This is analogous to the situation 
of building a network which contains M13 
multiplexers from different suppliers. DSIs 
are reliably multiplexed and demultiplexed 
but many of the desired operation support 
features are not standardized and therefore 
not available. 

SONET Phase II was to finish the specifica- 
tions of the protocols and messages used to 
communicate OAM&P information over the 
SONET Phase II interface. Figure 4 illustrates 
the protocols and where they exist in the 
network. The DCC refers to the embedded 
operations channel (EOC) in the SONET inter- 
face (the NE-to-NE link). The local communica- 


(OSI protocol) 


°C : Data Communication Channel 
DCN: Data Communication Network 
LCN: Local Communication Network 
OS: Operations System 
OSI : Open System Interconnect 


Fig. 4-SONET operating system. 
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tions network (LCN) refers to the local area 
network interface used to communicate between 
the NE and the OSs. As shown in the figure, 
a mediation device can be used to convert 
the local area network protocol used in the 
LCN to the protocol required by the OS. 
Notice also that all of the upper layers (layers 
4 to 7) are the same for all protocol stacks. 
SONET was released in these phases because 
one is able to migrate from Phases I to II and 
beyond by replacing the read only memories 
(ROMs) or by downloading software. 

Unfortunately, the Tl committee did not 
complete the specifications of the CMISE 
messages to be used over the DCC in the Phase II 
specifications. However, an agreement was 
reached on the protocol stacks. It is anticipated 
that the first set of messages will be available 
in the first half of 1991. This will allow suppliers 
to implement the necessary protocols and 
update the messages when appropriate. 

For the next several years, SONET inter- 
faces will most likely communicate using 
transaction language 1 (TL-1) messages. This 
is because many of the customers’ OSs only 
accept TL-1 and will not be in a position to 
accept CMISE messages for several years. Trans- 
lation from abstract syntax notation | (ASN.1) 
to TL-1 is not practical in the smaller network 


omy ae 
PF] sts. wy 
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elements which implement many of the SONET 
interfaces. Therefore development of ASN.1, 
even though it may be standardized, will be 
outpaced by the development of OSs which 
can accept the new message format. 


4. Equipment design 
4.1 FLM 50/150 

This equipment multiplexes the North 
America DS1 and DS3 signals into the synchro- 
nous transport signal level | or level 3 (STS-1 or 
STS-3 shown in Fig. 5) of the SONET frame 
format, for transmission at the optical level 
of OC-1 or OC-3. 

Figure 6 is a block diagram of the FLM 150. 


3 x 90 bytes 
[es nytes— : 


STS-3 envelope capacity 


Transport 


overhead VTLS5 x 84 


DS3 x 3 
with STS path overhead x 3 


———7 rows 


Signal length (ys) 


Fig. 5—STS-3 (155.52 Mbit/s) frame structure. 
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Fig. 6—Block diagram of FLM 150. 
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VT1. 5 VT SPE 
(1.728 Mbit/s) 


VT byte 


V,: VT pointer-1 
VT byte V,: VT pointer-2 
‘ V,: VT pointer-3 
(action) 
V,: VT reserved 
V;: VT path 


overhead (POH) 


VT byte R : Fixed stuff 


500 us 500 ys 


Fig. 8—VT1.5 superframe structure. 


Figure 7 shows the front view. Power consump- 
tion is 72 watts when 84 DS\Is are provided. 
1) Mapping and pointer action 

The FLM 50/150 uses the floating VT 
and DS3 asynchronous mapping to support 
the following payloads: 

1) DS1 asynchronous 
2) DSI bit-synchronous 
3) DS3 asynchronous. 

Figure 8 shows the VT1.5_ superframe 
structure. The FLM 50/150 has the following 
SONET characteristics: Enabling a _ positive 
bit stuffing operation to absorb the offset 
of input frequency in asynchronous signals, 
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1.0 


Output jitter (UI,_,) 


—60 -40 —20 0 20 40 60 
Bit rate offset x (ppm) 


12 9 6 3 0 3 6 9 12 
Pointer action cycle y (Hz) 


Fig. 9—DS1 bit-synchronous output jitter. 


Table 2. Output jitter and DPLL parameters 


Se DPLL design 
—| 
Sampling frequency (MHz) Pro nare 
Buffer stages 72 
DPLL counter value 256 
Output jitter (UIp-p) 1.05 


and pointer action operations to absorb the 
fluctuation of synchronous timing. 

This timing fluctuation is due to the jitter/ 
wander from the building integrated timing 
supply (BITS) or external synchronous circuits 
in the NEs, and also plesiochronous operation 
spanning different synchronous islands. 

A gap exists with a maximum of 3 bytes 
in a cycle of 125 us when extracting the DS] 
signals from a VT format. If a pointer action 
occurs, one more byte gap is added. The byte 
gap at a pointer action, which occurs in a slow 
cycle, affects the output jitter of DSls from 
the equipment. 

Figure 9 shows the DSI output jitter with 
VT pointer actions occurring in a bit-synchro- 
nous mapping. 

A digital phase-locked loop (DPLL)® with 
low cut-off frequency characteristics was 
utilized to achieve these results. Table 2 shows 
the output jitter and DPLL parameters. 

This jitter was produced by changing the 
input bit rate in the bit-synchronous floating 
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Fig. 1O—DS1s channel unit (LSCH-D1), 


mode and causing VT pointer actions. 

Because there is a maximum of 500 (2-ms 
cycle) pointer actions per second, the average 
frequency of the VT pointer action y when 
the DS1 frequency is dislocated by x ppm is 

p= OWl93' x (Hz); 

According to the Bellcore specifications, 
the output jitter shall be no more than 1.5 UI 
peak-to-peak”. The above-mentioned DPLL 


satisfies this requirement. 
2) Minimization of equipment 
To minimize the size of the equipment, 
Fujitsu developed the following: 
i) Large scale integration (LSI) circuits for 
the mapping functions (five 12 000-gate 
CMOS ASICs, one ECL ASIC) 
ii) Modular  optical/electrical 
(O/E, E/O) 
iii) Surface mount technology (SMT) DS1 
channel unit. 
Since the sub-STS signals are transmitted via 
VT groups (VTGs) at 6.912 Mbit/s (equivalent 
to 4DSIs) when multiplexing DSI signals into 
the SONET frame, four channels were mounted 
on one DS| tributary unit for minimum size. 
Two LSIs (12 000 gates each and packaged 
in a Quad Flat Package) are used to multiplex/ 
demultiplex four DS1ls into a VT group. Func- 
tions included in this LSI are mapping the 
DSIs into floating VT1.5s and multiplexing 
the VT1.5s into a VT group. Figure 10 is a 
photo of the channel unit. 


converter 
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Top of 
the frame 


a) Input b) Output 


1): Insert the pointer value indicating the timing of input J1. 
2): Send data without delay after receiving it. 


Fig. 11—STS pointer processing (normal operation). 


3) Hubbing 

Hubbing is an application where one STS-1 
SPE from an OC-3 is transmitted to an OC-1 
without terminating the path as shown in 
Fig. 1, 

When an STS-1 in the incoming OC-3 signal 
is taken out and placed into the OC-1 signal to 
be sent to a remote station, the data delay must 
be minimized for all frequency offsets (e.g. 
offset of input frequency and output frequency) 
and data slips must not occur. This equipment 
minimizes the data delay and eliminates slips 
through the use of STS pointers and enables 
all frequency offsets to be absorbed. 

The left side of Fig. 11 shows the STS-1 
frame format of the input optical signal. The 
right side shows the frame format of the output 
optical signal. The frame moves from f, to f2. 
Jl in the figure is the first byte of the STS 
synchronous payload envelope (STS SPE). 

To send an STS SPE without delay, the J1 
byte (top of STS SPE) is mapped from the 
frame of the input optical signal to the frame 
of the output optical signal simultaneously. 
At the same time, a new STS pointer value 
is written to locate the start of the outgoing 
SPE. 

The equipment can cope with any frequency 
offset. To do this, the equipment continually 
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cl al ia 
2{H3| 
ijl 


Positive 
stuff byte 


EB 
E 


b) Output 


a) Input 


@: Receiving pointer value with increment bits inverted. 

@: Detect the phase difference between the input J1 and 
the output frame. 

@: Invert the increment bits (new pointer value). 


Fig. 12—STS pointer processing (with pointer action). 


Monitor 
unit 


Other units 


Common bus 


Switching 
control bus 


seer a —: Alarm and 

Switching f F 
control ; performance data 
unit —: Switching 


request data 


K1, K2 overhead bytes 
from high-speed unit 


Fig. 13—Switching architecture. 


supervises the SPE timing of the input optical 
signal and the frame timing of the optical 
signal to be sent by controlling the pointer 
value. (When the equipment detects a phase 
difference of eight or more bits in the buffer 
memory by using a phase comparator, the 
pointer value is incremented or decremented 
in the output optical signal.) Figure 12 shows 
an example of incrementing the pointer for 
in an incoming optical signal. 

For the pointer to be incremented, the J] 
timing phase differs by eight bits. If the phase 
of the frame of the output optical signal is 
compared with the phase of the transmit J1 


timing and the phase difference is eight bits 
or more, the equipment can increment the 
pointer value at the next frame and absorb 
the frequency offset. 

4) Switching architecture 

Figure 13 shows the optical line switching 
architecture. Each box in the figure is one unit. 
Units are connected with a common bus and 
a control line for switching. The common bus 
is controlled by the micro-processor of the 
monitor unit. The common bus detects a fault 
that requires switching and reports it to the 
switching control unit. It functions as a fault 
monitor, performance monitor, and mainte- 
nance control unit. 

The processor detects and processes the 
cause of switching in cycles of 10 ms or less 
to allow switching within 50 ms. It reports 
other performance data in a slower cycle. 
A multi-task operating system is _ installed 
in the monitor micro-processor and a_ timer- 
interrupt is used independently of the operating 
system to activate 10-ms processing decisions. 

In addition, the micro-processor in the 
switching control unit independently controls 
automatic protection switch (APS) according 
the protocol used over the K1 and K2 overhead 
bytes in conjection with the remote station. 

5) Temperature requirements 

Transmission systems such as_ the 
FL 50/150 when used in the loop area are often 
required to operate in uncontrolled environ- 
ments. In the U.S., the range of expected 
temperatures inside a cabinet, pedestal, or other 
enclosure used to house the equipment, gener- 
ally ranges from —40°C to +65°C!®). To with- 
stand large temperature ranges, the FLM 50/150 
is manufactured by temperature-screening on 
a component basis. 

A 100-cycle temperature test implemented 
by Fujitsu and Bellcore has demonstrated that 
the equipment meets all specifications over 
the larger temperature range. 


4.2 FLM 600 

The FLM 600 equipment converts an OC-12 
signal into DS3, STS-1 or STS-3 tributaries and 
vice versa. 
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OC-12 (W) 
OPT out 


OC-12 (P) 
OPT out 


OC-12 (W) 
OPT in 
OC-12 (P) 
OPT in 


External reference 
input/output 


I Office alarm 
interface interface Orderwire 


Craft OSS 


Fig. 14—Block diagram of FLM 600. 


Figure 14 is a block diagram of the FLM 600 
terminal equipment. MUX/DMUX, the optical 
sending and optical receiving units have a | + 1 
redundancy. The tributary units consist of four 
working units and one protection unit operat- 
ing in a 1:4 redundant structure. The high-speed 
units and the switching system in the tributary 
units are designed to operate independently 
to improve system reliability. 

Most circuits are designed in LSIs. Principal- 
ly eight parallel signals are used to handle the 
SONET frame format in the MUX/DMUX 
unit having 622.08-Mbit/s capacity. Therefore 
the maximum frequency of the parallel lines is 
77.76 Mbit/s to enable stable operation and 
reduce power demand. The 622.08-Mbit/s high- 
speed signals are processed from parallel to serial 
and in reverse in the optical send and recieve 
units. This high-speed conversion is ac- 
complished with state-of-the-art GaAs-ICs, 
which also reduce power demand. 

Figure 15 shows the FLM600 LTE. The 
power requirement is 114 watts when fully 
loaded. Up to four terminals can be mounted 
in a 7-foot rack. No fans are required due to 
the low power consumption of the equipment, 

Fujitsu is planning a flexible system architec- 
ture having various tributary interfaces including 
an upgrade from the FLM 600 to FLM 2400. 
This transmission system, operating at OC-48, 
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Fig. 1S—FLM 600 LTE. 


is also being developed as one of the FLM series. 


4.3 Optical interface 

For the SONET optical interface, it is 
essential that the interface conforms to the 
criteria for mid-span meets and environmental 
conditions at an installation site. 

Minimization, low power consumption, and 
adaptability to the larger temperature ranges 
are required for loop systems with OC-1 or OC-3 
interfaces. In trunking systems with OC-12 or 
OC-48 interfaces, ultra high-speed devices and 
the modulation method are key technologies. 

This section focuses on the OC-1, OC-3, and 
OC-12 interfaces. 
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Table 3. FLM50/150/600 optical path design 


he Se OC level | OC-1.IR oc-3 LR OC-12 IR | OC-12 LR | OC-12 LR 
Item 1.31 um 1.31 um 1.55 um 1.31 um 1.55 um 
Line rate (Mbit/s) | 51.84 155.52 622.08 
Line code | Scrambled-NRZ 
Center wavelength (nm) E 270-1 340 | 1 270-1 340 | 1 285-1 330 | 1 525-1 575 | 1 291-1 333 |1 300-1 320 | 1 525-1575 
: i e | a 
: | 1 (20 dB 1 (20 dB 
Spectral width (nm)|} 7(RMS) | 7(RMS) 4 (RMS) down) 4 (RMS) 2 (RMS) down) 
Optical power (dBm) —15 —15 —2 —4 —15 —3 
Minimum receiving 
jexel (dBm) —30 —30 —39 —30 —31 —32 
=e = _ 
LD type MLM MLM MLM SLM MLM nae SLM 
APD type [ Ge Ge Ge InGaAs Ge Ge InGaAs 
Power penalty (dB) 1 : 1 | 1 
Cable dispersion 
100 100 160 2000 80 110 1 500 
: (ps/nm) | A 5 : [ 
Max span (km) | 15 15 40 100 | 15 40 80 


RMS: Root Mean Square 


4.3.1 Optical path design 

Table 3 gives the optical path design 
parameters conforming to the SONET Phase I 
specifications. 

1) FLM 50/150 optical path design 

At OC-1, using the intermediate reach (IR) 
option, data can be transmitted “over 15 km 
using a 1.31-uwm Fabry-Perot laser diode (FP-LD) 
and a germanimum = avalanche photodiode 
(Ge-APD) detector!?. At OC-3 utilizing FP-LD 
and APD for both the 1.31-um IR option and 
1.31-um long-reach (LR) option, data can be 
transmitted over 15 km with IR or 40 km with 
the LR 1.31 wm. At OC-3 using the 1.55-uwm LR 
option, data can be transmitted over 100 km 
using a distributed feedback laser diode 
(DFB-LD)’” and InGaAs-APD. 

In the 1.31-um systems of OC-1 and OC-3, 
since optical fiber dispersion is small, the 
distance is restricted only by optical fiber loss. 
In the 1.55-um system at OC-3 using a non- 
dispersion shifted fiber whose dispersion per 
kilometer is 20ps/nm, the total system 
dispersion for 100 km is 2 000 ps/nm, Therefore 
the power penalty is set to 1 dB due to the fiber 
dispersion. For this power penalty, the side 
mode suppression ratio of the 1.55-~m DFB-LD 
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MLM: Multi-Longitudinal Mode 


SLM: Single-Longitudinal Mode 


must be 30 dB or more. 
2) FLM 600 optical path design 

With the 1.3l-um IR otpion of OC-12, 
15-km transmission can be achieved with 
1.31-um FP-LD and Ge-APD. The 1.31-um LR 
option of OC-12 provides 40-km transmission 
utilizing a 1.31-um DFB-LD and Ge-APD. 
With the 1.55-um LR option of OC-12, up 
to 80-km transmission is accomplished with 
a 1.55-um DFB-LD and InGaAs-APD, With the 
1.31-um and 1.55-um IR/LR options of OC-12, 
the power penalty is set to 1 dB due to the fiber 
dispersion. 

4.3.2 Optical module design 

In the FLM 50/150/600 systems, optical 
modules are being designed according to the 
fiber span length. The 1.31l-um IR optical 
module at OC-1 and OC-3 levels is designed 
without a Peltier cooling element for tempera- 
ture conditions from —40°C to +65°C to reduce 
the power cinsumption. 

This section describes the minimum-size 
optical modules for LR OC-3 (1.55 um) and for 
LR OC-12 (1.31 wm) used for trunk lines. 

Figure 16 is a block diagram of the elec- 
trical/optical converter (E/O) module and the 
optical/electrical converter (O/E) module. 
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Bistable 
circuit 


density 
detector 


Peak 
detector 4 
erin eS ao | 


b) O/E module 


Fig. 16—OC-1, 3, 12 module block diagram. 


E/O 


Fig. 17—OC-3 1.55 um E/O, O/E module. 


Figure 17 is a photo of the E/O module 
containing the DFB-LD and the O/E module 
used in the OC-3 1.55-um LR option. 

Figure 18 is the E/O and O/E module used 
in the OC-12 1.3 1-um LR option. 

1) OC-1/OC-3 optical transmitter 
To cope with high-density module mounting, 
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= 


Fig. 18—OC-12 1.31 um E/O, O/E module. 
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=30 


Spectrum (dBm) 


1.52 1.545 1.57 
Wavelength (um) 


Fig. 19—OC-3 optical output spectrum. 


the size of each functional block needs to be 
minimized. However, minimizing the LD module 
in the optical send-receive unit was restricted by 
its shape. 

To minimize the transmitter, Fujitsu has 
developed an LD module by applying the 
following technology to the drive circuit: 

i) Integrated circuits (ICs) using Si-bipolar 

processes (Two ICs) 

ii) Reduction of heat resistance by utilizing 

ceramic substrates 

The pulse width of the input data is con- 
trolled by a bistable circuit and duty-cycle ad- 
justment circuit. The LD drive unit drives the 
LD with the required current. The automatic 
power control (APC) circuit stabilizes the optical 
output power. The APC circuit amplifies the 
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Fig. 20—OC-3 optical output waveform. 
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Fig, 21—OC-3 LR (1.55 um) bit error rate characteristics. 


transmitter optical signal detected by the photo- 
diode monitor and stabilizes the optical output 
power by feedback control. This module also 
has an LD current alarm function for detecting 
excessive LD current. 

Figure 19 shows the optical output spectrum 
characteristics of the DFB-LD module using an 
OC-3 1.55-um LR option. 

Figure 20 shows the optical output wave- 
form at OC-3, 1.55-yum LR. 

2) OC-1/OC-3 optical receiver 
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Fig. 22—OC-12 optical output spectrum. 


Each function block consists of the ap- 
propriate Si-bipolar process ICs. In addition, 
the radiation effects due to modules mounted 
on highly reliable miniaturized ceramic substrate 
are also reduced. 

For the optical detector, Ge-APD is used 
for the 1.31-um option and InGaAs-APD is used 
for 1.55 um. The current from the APD is 
amplified to the required level by a preamplifier 
and main amplifier. The timing unit regenerates 
a stable clock by non-linear extraction and 
a surface acoustic wave (SAW) filter. The timing 
unit also provides a low optical signal alarm by 
detecting a decrease in the amplitude of the 
timing signal. The regenerative circuit identifies 
the signal amplified equally and issues the signal 
from the module. 

Figure 21 shows the OC-3 1.55-um LR bit 
error rate characteristics. 

3) OC-12 optical transmitter 

The OC-12 LD module has been minimized 
in the same way as the OC-1 and OC-3, The 
block diagram of the E/O module is the same 
as that of the OC-1 and OC-3. 

Figure 22 shows the optical output spectrum 
of the DFB-LD module in the OC-12 1.31-um 
LR. Figure 23 shows the OC-12 1.31-yum optical 
output waveform, 

4) OC-12 optical receiver 

Each block consists of the appropriate 
Si-bipolar process ICs in the OC-12 module 
in the same way as the OC-1 and OC-3. The 
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Fig. 23—OC-12 optical output waveform. 
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Fig. 24—OC-12 LR (1.31 um) bit error rate 
characteristics. 


block diagram of the O/E module is the same 
as that of the OC-1 and OC-3. 

Figure 24 shows the OC-12 1.3l-um LR 
bit error rate characteristics. 


5. Conclusion 
Fujitsu. has developed and delivered the 
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FLM 50/150 and FLM 600 terminal equipment 
conforming to the SONET Phase I specification 
to RBOCs ahead of its rivals. A Phase II package 
is being developed to upgrade existing Phase I 
products to conform to Phase II standards. 
In addition, the FLM architecture will have 
enhancements to conform with future standards. 
To construct more flexible SONET networks in 
the future, an optical ADM, ring, and integrated 
digital loop carrier (IDLC) must be developed. 
The basic specifications and technology are 
common all over the world. Therefore, Fujitsu 
will provide the international market with 
SONET products, referred to in CCITT as 
synchronous digital hierarchy (SDH). Ac- 
complishing this requires the mass production 
of gigabit optical and LSI technology. It is also 
important to review with the customers the 
operation, control, and maintenance functions, 
including Phase II and beyond. 

SONET systems are developing towards 
the subscriber and inhouse delivery of broad- 
band ISDN. Further study and development 
are required to greatly reduce system costs 
for the economical introduction of these 
advanced services. 

Fujitsu intends to promote a total SONET 
approach in line with its goal of becoming a 
world lead in the field. 
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Conductive Fur Brush Charging of 


a Dielectric Surface 


® Masahiro Wanou @ Masatoshi Kimura 


(Manuscript received September 12, 1990) 


Charging of a dielectric surface without corona discharge has been achieved using a conduc- 


tive fur brush to which an electrical potential -of several hundred volts is applied. In this 
method, it was observed that brush charging is accomplished by direct charge transfer, gas 
discharge, and triboelectric charging. Direct charge transfer, the primary charging process, 
was analyzed using an ohmic contact model. The charged potential is nearly proportional to 


that of the brush. When the charging time exceeds 0.3 seconds, the charged potential on the 
surface of the photoreceptor saturates at almost the potential of the brush. Brush charging 
is, therefore, useful as a low-voltage charging technique. 


1. Introduction 

Corona charging is a well-developed tech- 
nique, and is generally used in electrophoto- 
graphy >”), However, the corona _ charging 
device requires several thousand volts to produce 
corona emission; also, it generates ozone, 
which can damage other devices such as photo- 
receptors. Therefore, a new low voltage charging 
method is required. 

It is known that a conductive brush main- 
tained at several hundred volts and placed in 
contact with a dielectric surface can transfer 
its charge to the dielectric. However, there 
is no detailed report available on brush charging. 
Also, although transfer using a low voltage 
to create a charge pattern on a dielectric layer 
has been studied for electrography®’”. These 
studies investigated the amount of charge 
applied from styli less than 100 wm in diameter. 

Fujitsu has analytically and experimentally 
investigated conductive fur brush charging. 
Brush charging experiments were carried out 
with Mylar film and photoreceptors. 


2. Brush charging model” 

A conductive fur brush maintained at 
several hundred volts and placed in contact 
with a dielectric surface can apply its charge 
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to the dielectric surface. In this method, there 
are three charging processes. These are: direct 
charge transfer due to contact between the 
brush fiber and dielectric surface, charge transfer 
by gas discharge in the small air gap between 
the fiber tips and surface, and triboelectric 
charging. The triboelectric charge depends 
on the surfaces of the two materials that rub 
together” and is less than the charges due to 
the other two processes. Therefore, direct charge 
transfer and gas discharge only were analyzed. 


2.1 Direct charge transfer 

A model and its equivalent circuit for brush 
charging are shown in Figs. | and 2 respectively. 
In the equivalent circuit the conductive brush 
is regarded as a resistor, the dielectric layer 
as a capacitor, and the brush-dielectric interface 
as a capacitor in parallel with a resistor. For 
simplicity, the brush-dielectric interface is 
treated as an ohmic contact. 

During charging, a real charge g, flows 
through the contact region of the brush fiber 
and dielectric and is deposited on the surface 
of the dielectric. Charge g, is induced at the 
small air gap between the fiber tip and dielectric 
surface. Charge qq is the sum of the induced 
charge qc., real charge g,, and initial charge on 
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Fig. 1—Brush charging model. 
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Fig. 2—Equivalent circuit. 


the dielectric surface go). After the dielectric 
leaves the brush, the real charge q, and the intial 
charge go remain on the surface to form the 
surface potential of the dielectric. 

In the equivalent circuit, the following 
equation is derived assuming that the initial 
values of g, and q, are zero. 


Ge = dd — Ge = Gee 
The voltage across the _ brush-dielectric 
interface is as follows: 


da; _ 4 


dt Ce 
where R, is the resistance of the contact region 
and C, is the capacitance of the small air gap. 


Potential V, applied to the brush can be 
represented by: 
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Substituting Equation(1) with Equation (3), 
gives: 
dq; 


dt 


d 
Va = Ro— Ge +42 + Go) + Re ZG 
dt 
Gc +4: +40 
+ —————————— 
Ca 


Also, by substituting Equation (2) with 
Equation (4) and rearranging, the following 
differential equation for q, is obtained: 

dq; dq 


+B 
dt? dt 


A +@; =CaVa— Go, »«»G) 


Where A = C.CaRpRe 5 
B=C.Ro + Gaky + Gak ox 
It is assumed that go is zero prior to 
charging. For further simplification, since Rp is 
low compared to R,, brush resistivity Rp is 
ignored. By solving Equation (5) for gq; and 
substituting Rp =O and qo =O0, the charge 
applied to the dielectric is as follows: 


t 
a= Ca¥a|1~ exp {-=——-} |. 
RAC. + Ca) 


The equation for surface potential V, then 
becomes: 


t 
V; = raf —exp \-sasas} ‘ (7) 
R.(C, + Ca) 


2.2 Charge transfer by gas discharge 

When the brush potential exceeds the break- 
down voltage, a gas discharge takes place in 
the air gap between the brush fiber tips and 
dielectric surface. As the gas discharge proceeds, 
charge accumulates on the dielectric surface, 
reducing the potential across the brush-dielec- 
tric interface. The gas discharge is extinguished 
when the gap potential falls below the minimum 
value required to sustain the discharge. After 
extinction, the direct charge transfer discussed 
in the preceding section takes place. Since the 
gas discharge occurs in a very short time, the 
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charge due to the gas discharge is regarded as 
only an initial charge in the direct charge trans- 
fer process. 

The extinction voltage between the brush 
and the ground plane of the dielectric V., is 


given by the following empirical equation”: 


La 
Vex = V gex +£i,— ; 
€r 


where Vgex is the extinction voltage of the air 
gap between the brush and dielectric surface, 
and £, is the electric field on the dielectric 
surface. The gas discharge is extinguished when 
the potential on the dielectric surface reaches 
Ve = Ve — Vex. Therefore, it is assumed that 
the dielectric has an initial charge gy = CaVge on 
its surface prior to the direct charge transfer. 

By solving the differential Equation (5) for 
qr using the initial charge gg, and by ignoring 
the small Rp, the charge and surface potential 
on the dielectric is as follows: 


t 
qr = Ca E — Vex exp - oom ; 
R.(Cy + Cq) 


3. Experiment 

Figure 3 shows the experimental setup. 
A Mylar film and selenium-coated drum are 
used as electroreceptors. Aluminized Mylar 
films of 25-um, 50-um, and 75-um thickness 
are mounted on a grounded metal drum. 
A conductive fur brush roll or bar brush is 
placed in contact with the surface of the electro- 
receptor. The conductive fur brush roll (32 mm 
diameter) consists of a metal core and conduc- 
tive rayon fiber. The bar brush consists of 
a metal plate and conductive fiber. The values 
of the brush parameter were determined ex- 
perimentally. The brush fiber density is 
155 fibers per square millimeter. The diameter 
of the fibers is 10 wm, and the resistivity is 
10? Q-cm. A voltage of approximately —500 V 
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Fig. 3—Experimental setup. 


to +700 V is applied to the brush. The probe 
of a surface potential meter is positioned over 
the drum. The probe output can be displayed 
visually on an X-Y plotter, or it can be digitized 
and stored for analysis. 

The conductive brush roll was used to 
charge the Mylar film. The brush is maintained 
at a selected constant potential, and transfers 
a charge to the Mylar surface. The brush is 
rotated at 500 rpm to achieve charging with 
good uniformity. The drum is rotated at a 
surface velocity of 190 mm/s. Since the Mylar 
has a high resistivity, there is no charge decay. 
Therefore, a charge accumulates during multiple 
rotations of the drum. The charging time is 
controlled by controlling the number of drum 
rotations. The charge potential on the Mylar 
is measured using surface potential meter. 

The photoreceptor drum was charged by 
the conductive bar brush. The bar brush is 
placed in static contact with the Se-Te drum 
surface. The measurement is made in the same 
way as for the Mylar except that the photo- 
receptor drum measurement is made in the dark. 


4. Results and discussion 
4.1 Triboelectric charge 

Figure 4 shows the relationship between 
the surface potential V, on the Mylar film and 
the charging time when the brush potential is 
fixed at zero. Figure 4 shows that V, decreases 
with charging time. Even though the brush is 
grounded, a charge flows through the brush- 
dielectric interface. This charge flow is caused 
by the triboelectric phenomena. The magnitude 
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of the triboelectric charge is determined by 
the surface states of the materials rubbing 
against each other®. Therefore, the surface 
potential V, increases with the thickness of the 
Mylar film Lq. 


V) 


( 


Surface potential V. 


0 0.5 1.0 
Charging time ¢ (s) 


Fig. 4—Triboelectric charge for Mylar of various 
thickness. 
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Fig. 5—Relationship between V, — V, and charging time 
for direct charge transfer. 
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In the following discussions, the triboelectric 
charge was eliminated from the experimental 
results in order to investigate the charge transfer 
process. 


4.2 Direct charge transfer 
This section discusses the direct charge 
transfer process without gas discharge. By 


rearranging Equation(7), V,—V;, can be 
represented as: 
t 
V, — V, = Vaexp Perera > CD) 
R (Ce o Ca) 


This shows that the logarithm of V, — V, is 
a linear function of the charging time. Figure 5 
shows the relationship between V, — V, and the 
charging time when the parameter is the brush 
potential V,, and the Mylar film thickness 
La is 25 um. Figure 5 shows that the logarithm 
of V, — V, decreases linearly with an increasing 
charging time and that the slopes of these lines 
are the same. The charging time constant of 
Equation (11) can be obtained from the slope 
of these lines and was found to be 380 ms. 
The capacitance of the dielectric Cg can be 
calculated from the Mylar film thickness Lg and 
its relative dielectric constant (e, =3.2). 
Capacitance C, was measured at 0.18 wF/m? in 
the experimental setup. Using these values, 
resistance R, = 2.9 x 105 ohm-m?. 


Mylar film 7,4: 300V 
Ca(uF/m?) 


(V) 


Surface potential V. 


0 0.5 1.0 
Charging time ¢ (s) 


Fig. 6—Surface potential as a function of charging time. 
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(La: 25 pm) 
Ca: 1.13 wF/m? 


0 0.5 1.0 
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Fig. 7—Relationship between V, — V, and charging time 
for gas discharge process. 


Figure 6 shows the surface potential V, as 
a function of the charging time for three differ- 
ent values of dielectric Cg when the brush po- 
tential V, is 300 V. The solid lines indicate the 
results of computation when R, =2.9 X 105; 
these results agree well with the experimental 
results when the charging time is longer than 
0.3 seconds. However, the experimental data 
deviates from the solid line when the charging 
time is shorter than 0.3 seconds. This deviation 
seems to be caused by nonuniformity of R,. R, 
is believed to be influenced by the electric field 
(increasing with a decreasing electric field). 


4.3 Charge transfer by gas discharge 

Figure 7 shows the relationship between 
the absolute value of V, — V, and the charging 
time when the Mylar film is 25 wm_ thick 
(Cq = 1.13 wF/m?). The potential applied to the 
brush is high enough to cause gas discharge. 
The plots for the same polarity of V, are almost 
the same, and their slopes are the same as 
the slopes of the direct charge transfer plots 
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Table 1. Extinction voltage 


Extinction voltage: V., (V) 


Mylar thickness 


(um) V,: Positive V,: Negative 
25 -220 
50 -263 
i fe -277 


Mylar film 
(La: 25 pm) 
Ca: 1.13 »F/m? 


(V) 


Surface potential V. 


1 4 
250 500 


Brush potential Va (V) 


Fig. 8—Relationship between surface potential and 
brush potential. 


shown in Fig. 5. As soon as V, is applied, gas 
discharge occurs and charge is deposited on the 
Maylar film. At the beginning of the charging 
process (t= 0), V, — V, rapidly decreases to the 
extinction voltage U,,. Figure 7 shows that 
Vex = 350 V for a positive brush potential, and 
Vex = —220 V for a negative brush potential. 
Since the amount of electron emission with 
a negative brush potential is greater than that 
with positive brush potential, V., for a negative 
V, is lower than that for a positive V,. From 
further experiments, V., for Mylar films 50-um 
and 75-um thick were obtained (see Table 1). 
Figure 8 shows the relationship between the 
surface potential and the brush potential for 
two different charging times as calculated from 
Equations (7) and (10) . The figure shows that 
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for a charging time of 0.8 seconds, the two 
potentials are nearly the same at all points and 
the plot is an almost straight line. For a charging 
time of 0.2 seconds the potential difference is 
much greater and the plot has three distinct 
stages. These results agree well with the experi- 
mental results. The equations can be used to 
determine the surface potential V, for a given 
V,, Ca, and charging time. 


4.4 Charging of photoreceptor® 

A bar brush (see Fig.9) was used for an 
experiment in charging the photoreceptor. 
The bar brush is a simpler charging device than 
the conductive brush roll; however, charging 


Bar brush 


Photoreceptor 


Fig. 9—Bar brush charger used to charge photoreceptor. 
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Fig. 10—Relationship between surface potential on 
selenium photoreceptor and brush potential. 
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with a bar brush tends to increase nonuniformi- 
ty of charge potential. 

Figure 10 shows the relationship between 
the brush potential V,, and the surface potential 
on the Se-Te drum JV, in the saturated state. 
The relationship is linear, and V, is nearly the 
same as V,. The nonuniformity of charge 
potential was measured in the direction of 
photoreceptor travel and is represented as +30 
(o is the standard deviation). The nonuniform- 
ity is indicated by the vertical bars in the graph. 
The surface potential V, is slightly negative at 
V, = 0 because of triboelectrification. 

Figure 11 shows the relationship between 
surface potential and charging time when the 
brush potential V, is fixed at 600 V. The 
charging time is controlled by changing the bar 
brush width. When the charging time is longer 
than 0.3 seconds, the surface potential V, satu- 
rates to reach almost V,. The nonuniformity in 
charging represented by +3a is indicated by the 
vertical bars in the graph. Nonuniformity 
decreases with increasing charging time and 
becomes less than 50 V when the charging time 
exceeds 0.3 seconds. The nonuniformity in 
corona charging is almost 50 V; therefore, the 
brush charging method is suitable for electro- 
photography. 


600 


500 


(V) 


400 


300 


Surface potential Vs 


100 


Va: 600 V 
Photoreceptor : Se-Te alloy 


0 0.2 0.4 0.6 
Charging time ¢ (s) 


Fig. 11—Surface potential as a function of charging time 
for selenium photoreceptor. 
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0 0.2 0.4 0.6 
Charging time ¢ (s) 
Fig. 12—Relationship between V, — V, and charging 
time for selenium photoreceptor, 


Brush potential V. (V) 


0 0.2 0.4 0.6 
Charging time ¢ (s) 


Fig. 13—Relationship between brush potential and 
charging time for selenium photoreceptor. 


Figure 12 shows the relationship between 
the charging time and V, — V, as calculated 
from the relationship shown in Fig. 11. Air 
breakdown occurs when the brush potential 
reaches 600 V. Therefore, Equation (10) applies 
to the charging process. The extinction voltage 
of gas discharge V,, is obtained from the 
intercept of the vertical axis of the graph and 
is 350 V. The time constant of charging, 
T=R, (C, + Cg), is obtained from the slope of 
the line and is 200 ms. 
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Figure 13 shows the relationship between 
the charging time and brush potential. This 
figure can be used to determine the required 
surface potential on the Se-Te drum for electro- 
photographic recording. The graph was drawn 
using Equation(10) with +tr=200ms_ and 
Vex = 350 V (from Fig. 12). This graph can 
also be used to determine the brush potential 
V, for given charging time and surface potential. 


5. Conclusion 

Studies on the charging process and charac- 
teristics of conductive fur brush charging led to 
the following conclusions: 

A conductive fur brush can apply both 
positive and negative charges to a dielectric 
surface. The charged potential is nearly propor- 
tional to that of the brush. 

Charge is deposited on the dielectric surface 
by three processes: triboelectric charging, 
direct charge transfer, and gas discharge. 

The direct charge transfer process can be 
described using an ohmic model. The charged 
potential is a function of the brush potential, 
charging time, dielectric capacitance, contact 
capacitance, and contact resistance. 

In the gas discharge process, the charged 
potential depends on the extinction voltage 
of gas discharge. 

Nonuniformity of charged potential on 
a photoreceptor is less than 50 V when the 
charging time is longer than 0.3 seconds. The 
brush charging method is suitable for use in 
electrophotography. 
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All-Optical Devices by Computer 
Simulation 
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This paper investigates the use of organic materials for nonlinear optical devices as a means 
to providing a breakthrough in optical nonlinearities. For electro-optic devices, an improve- 
ment in second-order nonlinearity 10-100 times as large as that of LiNbO 3 is needed. For 
all-optical devices, an improvement in the third-order nonlinearity over 10 times as large as 
that of ordinary polydiacetylene (PDA) is required. To achieve these objectives, several 
organic materials were designed based on newly proposed guidelines requiring that the 
balance between wave function overlap and separation (or difference) be optimized. 
Computer simulations show that the objectives may be attainable by controlling the wave 
function and forming quantum wells through adjusting donor and acceptor substitution sites 


in one-dimensional conjugated systems. 


1. Introduction 

Nonlinear optical devices such as electro- 
optic (EO) devices, all-optical devices, and 
optical ICs consisting of these devices are 
important in optical communication, optical 
information processing, and other optical 
systems. However, practical nonlinear optical 
devices have not yet been fabricated due to 
a lack of nonlinear optical materials with high- 
performance. 

Organic materials with conjugated systems 
have attracted interest because of their large 
optical nonlinearity)». This nonlinearity is 
expected to be superior to that of conventional 
inorganic materials such as LiNbO; (LN) and 
the multiple quantum wells (MQWs) of 
compound semiconductors. However, despite 
much work on organic materials, no notable 
improvements in optical nonlinearities have 
been made. 

The author’s motivation on this research is 
to make a break-through in the nonlinear optical 
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properties of organic materials and to clarify the 
potential of the organic materials for EO and all- 
optical devices. 


2. Nonlinear optical phenomena 

Nonlinear optical phenomena can be under- 
stood using the spring analogy in Figs. la) to 
c). In a spring, displacement x is proportional 
to external force F. With increasing F’, the spring 
stretches rapidly with a nonlinear relationship 
between x and F. Nonlinear optical phenomena 
can be similarly described by replacing x with 
polarization P and F with electric field E. 
When E is small, P is proportional to £. With 
increasing E, the contribution of the E? or E? 
term becomes dominant. The E? and E? terms 
induce second-order and third-order nonlinear 
optical effects (see Table 1). The Pockels effect, 
in which the refractive index changes linearly 
with an applied electric field, is a second-order 
nonlinear effect and is used by EO devices. 
The optical Kerr effect, in which the refractive 
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index changes linearly with incident light 
intensity, is a third-order nonlinear effect 
and is used by all-optical devices. 

Measures of second- and third-order optical 
nonlinearities for a molecule are given by 
molecular second- and third-order nonlinear 
susceptibilities 8 and y. For aggregates of many 
molecules, macroscopic second- and third-order 
optical nonlinearities can be given by second- 
and third-order nonlinear susceptibilities x? 
and x®. When the intermolecular interaction 
is weak, x” is approximately the sum of Bs of 
molecules in a unit volume”. Similarly, x? is 
the sum of ys. Electro-optic coefficient r, which 
is proportional to x, is usually used as the 
measure for Pockels effect. 

The relationship between FE and P can be 
translated into the electronic potential energy 
vs. polarization curve {see Fig. 1b) and c)}. 
When £ and P are linearly related, the potential 
energy curve is parabolic. This corresponds to the 
fact that the potential energy of a spring is 
proportional to x*. When nonlinear terms 
predominate, the potential energy curve deviates 
from the parabola. 

In general, the potential curve deviation 
becomes large in highly polarizable systems. 
Figure 2 illustrates electron-cloud polarizations 
induced by an electric field. In dielectric 
materials such as LN, electronic polarization is 
not large because electrons are localized near 
atoms, regarded as a strong spring. In semi- 
conductors, electron clouds are widely spread, 
regarded as a weak spring. Therefore, compared 
to dielectric materials, large electronic polariza- 
tion is attainable. Organic materials, especially 
long-chain molecules, have electrons delocalized 
one-dimensionally, i.e. they are regarded as 
natural quantum wires. The one-dimensional 
characteristics lead to  effcient electronic 
polarization induced by an electric field along 
the direction of the chain. This is why the 
organic materials seem promising for nonlinear 
optical devices. 


116 


Table 1. Nonlinear optical phenomena for EO and 
all-optical devices 


Effect 


Devices Measure 


Pockels EO | Bx? + 


All-optical sf x9) 


Spring 
x=aF + bF? + cF3 +> 


Aggregate 
P= HZ B+ soy Bt be oy ZO + BP os 
E3 <7 eee 


Molecule 
p=aE+ BE* + yES +: 


O) 
| Nucleus 


Electron 


a) Spring analogy of nonlinear optical phenomena 


Nonlinear 


Polarization P(x) 


Linear 


Electric field EF (F) 


b) Polarization vs. electric field 


Linear 


Nonlinear 


Potential energy 


Polarization P 


c) Potential energy vs. polarization 


Fig. 1—Nonlinear optical phenomena. 
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Fig. 2—Why organic materials are promising. 
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Fig. 3—Integrated optical waveguide deflector, and the 
calculated dependence of the deflection angle 
and resolution on electro-optic coefficient r. 
The prism electrode is 8 mm long and 5 mm 
wide, and the gap between prism and counter 
electrodes is 5 um. 


3. State-of-the-art nonlinear optical materials 
and their objective 
3.1 EO device materials 
Figures 3 and 4 show typical EO devices. 
Our goal is to have devices widely used in 
optical systems. At present, most EO devices 
are fabricated using LN. Because the optical 
nonlinearity of LN is insufficient, except for 
the optical modulator, current device character- 
istics do not come close to accomplishing the 
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Fig. 4—Matrix optical switches and modulators, and 
correlation between the number of channels 
and switching time. 


objective. 

Figure 3 gives an example of the integrated 
optical waveguide deflector proposed in this 
paper. The deflector consists of a nonlinear 
optical organic waveguide 5 wm thick inserted 
between the counter and prism electrodes 
8mm long and 5mm wide. By controlling 
the voltage applied to the prism electrodes, 
we can induce waveguide prisms with a variable 
refractive index. It is then possible to scan 
the light beam in the waveguide. The calculated 
dependence of the deflection angle and re- 
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solution on the electro-optic coefficient r is 
shown in Fig.3. When r is about 30 pm/V, 
which equals the r of LN, the deflection angle 
is only a few degrees even when 100 V is 
applied. Many deflector applications require 
a deflection angle of several tens of degrees, 
however, and resolution of more than thousands 
of points. To attain the objective, a material 
with an r of about 300 pm/V to 3 000 pm/V, 


Pendant-attached 
polymer 


Electro-optic LiNbO. 
materials a ; 


i.e. 10 times to 100 times that of LN, is required. 

Figure 4 shows the correlation between 
the number of channels and the switching 
time for matrix optical switches and modulators. 
For the constant r, the switching time increases 
as the number of channels increases due to 
limitations on the speed of the circuitry. An 
increase in the number of channels causes 
the individual constituent switch to be small, 
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Fig. 5—Second-order nonlinear optical materials for EO devices 
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Fig. 6—Measurement system for AC modulation method and the typical light modulation signal for SPCD 
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resulting in an increase in the drive voltage, 
which slows the circuit speed. The LN limit is 
roughly determined in reference to the present 
circuit performance and a wafer several inches 
in diameter. Dynamic optical interconnections 
for broadband optical communications may 
require matrix optical switches with more 
than 16 channels operating at a bit rate exceed- 
ing 10 GHz. Thus, to reach the objective for 
matrix optical switches, an r 10times to 
100 times as large as that of LN is required. 

The electro-optic coefficients of EO mate- 
rials are compared in Fig. 59). Organic materials 
can be classified into pendant-attached 
polymers, molecular crystals, and conjugated 
polymers. For molecular crystals, in the initial 
period of research, the author developed crystal- 
growth techniques for styrylpyridinium cyanine 
dye (SPCD) and 2-methyl-4-nitroaniline (MNA). 
The purpose was to study the Pockels effects 
in demonstrating the usefulness of organic 
materials as EO materials?“!”. In 1987, using 
the AC modulation method (see Fig. 6), the 
author found the SPCD thin-film crystal had 
an r of 430 pm/V, about 14 times that of LN. 
To the author’s knowledge, this is the largest 
value ever observed in an organic material. 


The results demonstrated the possible use of 
organic materials in EO devices. Since 1987, 


however, despite the promising molecular 
crystal MNBA (4'-nitrobenzylidene-3-acetamino- 
4-methoxyaniline)!” developed by Toray, no 
molecular crystals have exceeded 430 pm/V. 
This suggests that the limit of r in ordinary 
molecular crystals is about ten times r (LN). 
For the pendant-attached polymers’! , r is 
smaller than that for molecular crystals at 
present. However they can be easily formed by 
spin-coating into thin films of high optical quali- 
ty. In principle, by improving the molecular 
alignment, it may be possible to achieve an r of 
the same level as that of the molecuar crystals, 
that is, ten times r (LN). 

As mentioned above, the electro-optic 
coefficient of about 10 times r (LN), which 
is the lower limit of the proposal, seems possible 
using the molecular crystals and pendant- 
attached polymers. Device fabrication using these 
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materials is one approach. However, to further 
improve the Pockels effect, another material, 
conjugated polymers like polydiacetylene 
(PDA), should be selected. PDA is regarded 
as a natural one-dimensional system (quantum 
wire), more favorable to inducing a larger 
oscillator strength!” than two- or three-dimen- 
sional systems. It has long variable wave func- 
tions offering a greatly enhanced Pockels effect. 


3.2 All-optical device materials 

Figures 7a) and b) show the correlation 
between x” and light powers /, and J, required 
to operate of all-optical devices. 

Here, J, and J, are given by 


Mode-locked LD 


10-1 10-8 10-8 10-7 10-8 
x” (esu) 
a) Bistable optical devices and all-optical phase 
modulators 


10? 


10! D= W=3 ym 
Ny = 2.0 A=1 um 


Mode-locked LD 


I. (W) 


107! 


0-2 
. 107 10-2 10-8 1077 10-8 


x (esu) 
b) Nonlinear interface devices 
Fig. 7—Correlations between x) and the light power 
required for device operation. 
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10-3 AlGaAs/GaAs 
: MQW 


All-optical systems 


(3000/3212 )SAno? /(Lx™), 
(3000/1672? )(1 — sin@ )DWn,? / 
(x cosé D. 

S is the waveguide cross-sectional area, A the 
wavelength of incident light in a vacuum, no the 
refractive index in the dark, LZ the light path 
length, @ the incident angle of propagating 
light in the waveguide, and Dx W the area 
exposed by controlling the light. Sizes are given 
in centimeters and x) is in esu. The advantage 
of all-optical systems is high speed, e.g. a 
response time of less than 10 ps. Therefore 
the light path length in the device must be 
less than about 0.15 mm _ because the light 
propagating speed, c/ng, is 0.15 mm/10 ps 
in a medium when no equals 2. Assuming 
operations with laser diodes (LDs) or mode- 
locked LDs, bistable optical devices and optical 
phase modulators need x® exceeding 1078 esu. 
Nonlinear interface devices require x” exceed- 
ing 10~7 esu. 

The response time and x” of typical 
materials for all-optical devices are compared 
in Fig. 8. GaAlAs/GaAs MQWs_ using the 
saturated absorption of excitons in shielding 
by photocarriers exhibit a large third-order 
nonlinearity. However, they have slow response 
due to slow’ photocarrier recombination!® 
In contrast, organic materials have a fast 
response of a few picoseconds or less, but a 
small x. Semiconductor-doped glasses show 
a property in between GaAlAs/GaAs MQWs 
and organic materials. Note that x® and the 
response time are extremely sensitive to 
resonant conditions!??”. In fact, in an off- 
resonant condition, the GaAlAs/GaAs MQWs 
have a fast response comparable to organic 
materials but x? decreases to less than 
10719 esu. Clearly, no material satisfies both 
x® and the response time (see Fig. 8). 

Of organic materials, the conjugated 
polymer, PDA, has the largest x®, but its 
third-order nonlinear susceptibility is 107° to 
10-10 esu?)-?9). This is insufficient by several 
orders of magnitude for practical application 
to all-optical devices operating with LDs. Thus, 
for the optical Kerr effect, an improvement 
of at least ten times is necessary. 


Tn 
Ie 
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T 
o 


Semiconductor- 
doped glass 


x (esu) 


Organic 
materials 


107” 
l 103 
Response time (ps) 
Fig. 8—Third-order nonlinear optical materials for all- 
optical devices. 


From sections3.1 and 3.2, the author 
concluded that the work should focus on 
improving second- and third-order nonlinear 
optical properties 10 times to 100 times using 
one-dimensional conjugated systems. 


4. Qualitative guidelines for improving optical 
nonlinearities 

Figure 9 shows the microscopic origins 
of second- and third-order optical nonlineari- 
ties. As mentioned in chapter 2, the potential 
energy curve for linear systems is parabolic. 
In nonlinear systems, however, it is not. This 
induces a variety of nonlinear phenomena. 
To induce second-order nonlinearity, the 
potential curve deviation must be asymmetric 
{see Fig. 9a)}. For third-order nonlinearity, 
symmetry does not matter {see Fig. 9b)} . 
The larger the potential curve deviation from 
the parabola, the larger the induced nonlinear 
optical effects. 

The driving force for this potential curve 
deformation is explained in terms of the shape 
of the wave function {see Figs. 9a) and b)} : 
Here, consider the ground state and one excited 
state, that is, a two-level model, for simplicity. 
Introducing a perturbation of incident light 
or applied voltage to a molecule induces mixing 
the excited and ground state wave functions, 
deforming the potential curve. Therefore, 


6) ,24) 
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4) oc Boe f(P, — P,) 


Dipole moment 
difference 


Oscillator strength 


P Wave function 


Gee Excited 


Ground dig RON Oe 
~~ 
| 


Molecule 


Linear 


Nonlinear 


Potential energy 


Polarization P 


a) Second-order 


the first approach to improving optical non- 
linearity lies in promoting the mixing rate 
of ground and excited states, i.e. promoting 
the oscillator strength f between the ground 
and excited states. Here, f«<e |r |g >?, which 
can be increased by promoting the wave func- 
tion overlap between the ground and excited 
states. However, only increasing the wave 
function mixing rate is not in itself sufficient 
to increase the potential curve deformation. 
For example, when the ground and excited 
state wave functions have a similar shape, 
mixing the wave function results only in a small 
change in electron distribution. Consequently, 
there is only a small potential curve deforma- 
tion. Another approach to improving optical 
nonlinearity is promoting the wave function 
difference between the ground and excited 
states. 

The following guidelines were derived: 
Second-order nonlinearity 
1) Promote wave function overlap between 

the ground and excited states, increasing 

oscillator strength fx«<e|r|g>?. 
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Oscillator strength 


Electron distribution 
difference 


Excited 


Linear 


Nonlinear 


Potential energy 


Polarization P 
b) Third-order 
Fig. 9—Microscopic origins for second- and third-order optical nonlinearities. 


2) Promote wave function separation between 
the ground and excited states, increasing 
dipole moment difference P, — P,, where 
P. =<e|rle>and P, =<g\rig>. 
Second-order nonlinearity would then be 

proportional to the product of f and P, — Py, 

consistent with the two-level model expression 
for molecular second-order nonlinear suscepti- 
bility, 

Baf (Pe — Pg). 


Third-order nonlinearity 

1) Promote wave function overlap between 
the ground and excited states, increasing 
the oscillator strength f«<e|r|g>?. 

2) Promote wave function differences between 
the ground and excited states, increasing 
Le —Lg where Le and Ly are measures of 
wave function distribution. 

Third-order nonlinearity would then be 
proportional to the product of f and Le, — Lg. 
As shown in Fig, 10, considering one excited 
state, |e >, which has a major transition matrix 
element with the ground state | g > and neglect- 
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g 

Fig. 10—Ground g and excited e states. e has a large 
transition matrix element with the ground state. 
n is the other excited state. 


ing the energy denominator, molecular third- 
order nonlinear susceptibility is roughly ex- 
pressed as 


yxU<g|[rle><elr|[n><n\|rle> 
n 


<élFle> 
weeel|rle><e|r* |e><elrilg> 
Then, assuming, <e|r? |e>«L, —Lg, 
the difference in expected values of r? between 


ground and excited states, on the analogy of 
P. — Pz in B, we obtain 


yY«<f(Le — Lg). oe HO ie (2) 
This is consistent with our qualitative 
guidelines. 


For both second- and third-order nonlineari- 
ty, however, note that f and P, — P,, or f and 
Le — Lg are not independent, but are a tradeoff. 
It is concluded that the balance of the wave 
function overlap and wave function separation 
(or difference) must be considered simultane- 
ously in optimizing nonlinear optical effects. 


5. Simulation of optical nonlinearity in newly 
designed organic materials 
Given the guidelines in chapter4, the 
author designed several molecules with PDA 
structures, and simulated second- and _third- 
order optical nonlinearities. 
5.1 EO device materials”>?” 
Figure 11 shows three wave function shapes. 
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Fig. 11—Relationship between second-order optical 
nonlinearity and wave function separation. 


Pull 


0—____———~— 


Acceptor 


Push 


~~~ ~ im. 
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Fig. 12—Push- and pull-effects of donor (D) and 
acceptor (A). Donors push electrons and 
acceptors pull electrons when excited by light. 


Wave function separation between the excited 
and ground states increases from left to right 
and wave function overlap between the two 
states decreases, ie. Pe —P, increases and f 
decreases in a tradeoff. B is expected to have 
its maximum in an intermediate wave function 
condition as suggested in chapter 4. To obtain 
such a wave function shape, the author tried 
controlling the wave function by adjusting 
the donor and acceptor substitution sites, 
as well as the molecular chain length, using push- 
and pull-effects of donors and acceptors (see 
Fig. 12). 

Figure 13. shows the new’ molecules, 
designed by Fujitsu’s Assistance of New 
Chemistry for Original Research (ANCHOR), 
These molecules have a PDA structure and 
an acetylenic backbone. NH, is the donor 
(D) and NO, the acceptor (A), DA, DAAD, and 
DDAA are types of donor and acceptor substitu- 
tion, and the numbers following them indicate 
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Fig. 13—New molecules designed by ANCHOR. 
White balls are carbon atoms, blue balls are 
nitrogen atoms, red balls are oxygen atoms 
and green balls are hydrogen atoms. 


DA34 DAAD34 DDAA30 


UNOCCUPIED STATES UNOCCUPIED STATES _— UNOCCUPIED STATES 


OCCUPIED STATES 


© 2 
co ff hi 
On ey soe 
. Seer e® 
ay ee ee 


OCCUPIED STATES 


DDAA30 


DAAD34 


M 


Fig. 14—Contour diagrams of molecular orbitals near the Fermi surface in plane z-0.6 A (0.06 nm) and GC, Pi 
for the first excited states. G denotes occupied orbitals and E unoccupied orbitals. G1 is HOMO and 
E1 is LUMO. M denotes the occupied molecular orbitals GM (M = 1,2, ..., 6) and N unoccupied 
molecular orbitals EN (N = 1, 2, ..., 8). The height of the pyramid indicates C,, GM >EN?*. 
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the number of carbon sites (N,) in the 
conjugated chain. 

B for the Pockels effect corresponding to 
the chain direction is calculated based on 


Ward’s expression”, 


€ 
p= ——— x Pyn'PninPng X 
2 # 
an? | sen 
n#n' 


3Wn'g Wng tw? 


eee + 
(Wy 'g aa wo” Wag” — aw) 
Sting +t" 
+ EPA GP, ——_——-)._... (3) 
- Wng? — w?)? 


Here, AP, =F, —8.f =<elrlu> is the 
transition dipole moment between the ground 
and excited states. Py,;=<n\|r|n'> is that 
between the two excited states. hwy, is the 
excitation energy from the ground to the 
excited state. hw is the energy of the incident 
photons. 

Pgn, Pnn' and AP, were calculated using 
the Austin Model 1 (AM1) method®, a semi- 
empirical molecular orbital method recently 
developed by modifying the core repulsion 
function in the modified neglect of the diatomic 
overlap (MNDO) method, Molecular orbitals 
were calculated with AM1. Using these orbitals, 
the author constructed ground state wave func- 
tion W,, a product of the occupied orbitals, and 
configuration functions x;.;, products of the 
orbitals with one-electron excitation from 
occupied orbital i to unoccupied orbital /. 
Excited states W, , expressed as linear combina- 
tions of configuration functions, Equation (4), 
were determined by single-excitation configura- 
tion interaction (SCI)?”, 


v. = Fe Cnisj Xi - 


CI calculation involved eight unoccupied 
orbitals above the lowest unoccupied molecular 
orbital (LUMO) and six occupied orbitals below 
the highest occupied molecular orbital (HOMO), 
Using W, and Wy, Pon, Pun and AP, were 
calculated, 

MNA was used as the standard molecule. 
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DDAA30 


DA34 DAAD34 


f, P. — Pg (arb. units) 


. DA34 DAAD34 DDAA30 
Fig. 15-8, Pe - Pg, and f of DA34, DAAD34, and 
DDAA30. 


6 is normalized by the molecular length. Ep is 
the exicitation energy for the first excited state. 


Detuning energy: 0.2 eV 


100 


r |X 733 (LN)} 


50 


Molecular crystal 


LN 


Molecular length (A) 


Fig. 16—Molecular length dependences of r. 
ris normalized by r33 of LN. 


From the calculation, B= 13 x 107%° esu at 
1.06 um. This agrees fairly well with the 
12 x 107° esu reported by Garito et al*”?. 
Figure 14 shows the molecular orbitals 
near the Fermi surface for DA34, DAAD34, 
and DDAA30. In DA34 and DAAD34, the 
charge separation in the chain direction appears 
mainly in the HOMO and LUMO. In other 
molecular orbitals, electrons in molecules 
tend to be delocalized. In DDAA30, however, 
as in LUMO, E2 and E3 orbitals also exhibit 
considerable charge separation. In Fig, 14, 
Co for the first excited state is also shown. 
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Cn,i+j Tepresents the fraction of the configura- 
tion function involved in W,, Equation (4), 
Combining these results with shape of the 
molecular orbitals clarifies the wave function 
shape. In DA34, the Gl El component, 
corresponding to the LUMO-to-HOMO transi- 
tion, is extremely small. The wave function 
separation then becomes small corresponding 
to the wave function condition on the left side 
in Fig. 11. In DAAD34, finite Gl >EN and 
GM->El1 components appear, resulting in 
a considerable wave function separation cor- 
responding to the condition in the center in 
Fig. 11. In DDAA30, Gl EN components 
predominate and, furthermore, the EN orbital 
exhibits considerable charge separation, inducing 
the large wave function separation like on the 
right in Fig. 11. Wave function separations 
increase in the order of DA34, DAAD34, and 
DDAA30. 

Figure 15 shows the calculated molecular 
second-order nonlinear susceptibilities per 10 A 


ML 


CB 
™ 


VB 


a) Example of quantum well structure 


v, 


b) Corresponding wave function 


(1 nm) of molecular length for DA34, DAAD34, 
and DDAA30. As expected, Pe — Py increases 
and f decreases in the order of DA34, DAAD34, 
and DDAA30. 8B becomes maximum in between 
for DAAD34. This parallels the tendency of 
the qualitative guideline (see Fig.11), and 
indicates that the balance between P, — P, and 
fis important in improving B. 

In addition, it was also found that the 
molecular length affects the wave function shape 
and that adjusting the length effectively 
improves nonlinear effects. Figure 16 shows 


ML = 600 


y (arb. units) 


f (arb. units) 


L, — Ly (arb. units) 


MMi: = 
Ls. [ee LE a eaaoet z 


MZ 


c)y, fand Le - Lg 


Fig. 17—Enhancement of y by controlling wave function shapes with quantum well structure. 
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Fig. 18—Enhancement of y by controlling wave function shape with double quantum well structure. 


the chain length dependence of r in DA, DAAD, 
and DDAA molecules. r is estimated assuming 
a PDA chain density of 1.3 x 10!4 1/cm? 3%. 
r reaches a maximum in the DAAD and DA 
molecules at ~22 A (N, = 18). In PDA, an r of 
about 100 x r (LN) is expected. 

It is therefore concluded that a large second- 
order optical nonlinearity (the Pockels effect) 
of about 100 times that of LN can be obtained 
by controlling wave function separation by 
adjusting donor and acceptor sites and the 
conjugated length in one-dimensional systems. 


5.2 All-optical device materials 

To improve the third-order optical non- 
linearty (the optical Kerr effect), the shape 
of the wave function was controlled by con- 
structing quantum well structures in a one- 
dimensional system, Figures 17a) and b) give 
an example of a quantum well structure and 
the corresponding wave functions. Varying the 
well length in the conduction band (ML) and 
that in the valence band (W) adjusts the wave 
function difference between the ground and 
excited states. D is the well width. Reducing 
D changes the system from being two- to one- 
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dimensional. y per unit length for the light 
polarization with the well length direction was 
calculated from Equation (2). For simplicity, 
it was assumed that the ground state wave 
function has even parity, the excited wave 
function odd parity, and wave functions are 
square waves. The results are shown in Fig. 17c). 
With reduced D, both L, — Ly, and f increase, 
then y increases rapidly. This clearly indicates 
that the one-dimensional system is better than 
a two-dimensional system for introducing 
large optical nonlinearity (see chapter 2). That 
is, a wave function extent perpendicular to the 
direction of light polarization contributes little 
to optical nonlinearity, simply diluting the wave 
function density and reducing optical non- 
linearity. With increasing W, i.e. by increasing 
the overlap and decreasing the difference in 
wave function between the ground and excited 
states, f increases and L, — Lg decreases, with 
peaking at an intermediate region in W. 

Figure 18 shows a double quantum well 
(DQW) with a subwell at either side. The wave 
function difference is controlled by the dif- 
ference in well depth between the conduction 
and valence bands. With increasing well depth, 
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the wave function tends to localize on either 
side. The calculated values of y, Le — Lg, and 
f are shown in Fig. 18c) as a function of 
R=C/D. R is a measurement of the wave 
function extent in the valence band as defined 
in Fig. 18b), With increasing R, Le —Lg in- 
creases and f decreases. y peaks at an inter- 
mediate region in R of about 1 when the balance 
between the overlap and difference of wave 
functions is optimum, In this case, y is improved 
about five times over that for a single quantum 
well (SQW). It was also found that further 
confinement of electrons in three or more 
subwells would make it possible to enhance y 
more than ten times. 


A DA DA DAD A 


KD A DD KR OD A DD 


5.3 Quantum well formation in PDA 

To apply the materials in sections 5.1 and 
5.2 to practical devices, it is necessary to 
fabricate the materials as a thin film. This 
involves constructing quantum well structures 
in one-dimensional PDA chains to make PDA 
thin-film crystals. 

Recently, the author found from molecular 
orbital calculation that the energy gap of PDA 
can be reduced when donors and acceptors 
are substituted in the DAAD type?” (see 
Fig. 19). Using this effect, the author expected 
to insert many DAAD molecules into the PDA 
chain, constructing a one-dimensional MQW 
(see Fig. 20)®) 34), This structure enables DAAD 
molecules to be aligned perfectly, which is 
favorable to attaining a large Pockels effect. 

Using energy gap narrowing, it should be 
possible to form quantum wells in PDA by 
modulating the energy gap with selective sub- 
stitutions of donors and acceptors into the 
PDA chain™):34), The DQW with two wells 
14A (1.4nm) long designed by ANCHOR 
is shown in Fig. 21 together with an SQW. 
Quantum wells are formed in PDA with 


VB 7 i 
Fig. 19—Band gap control in PDA by donor and acceptor 38 carbon sites. Areas with donor and acceptor 
substitution. substitution are wells, and areas with hydrogen 
DAADI18 DAADI18 DAADI18 
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1 I 1 | | | 
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Fig. 20—Multiple quantum well formation in PDA. Wells can be regarded as DAAD18 molecules inserted 


in a PDA chain. 
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Fig. 21—SQW and DQW structures designed by ANCHOR and their molecular orbitals. 
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substitution are barriers. The energy gap in the 
well region is about 1.6 eV, about one half 
of that in the barrier region. Molecular orbitals 
show that, in the SQW, electrons are spread 
throughout the molecular chain. In the DQW, 
electrons tend to be confined to both sides 
of the molecular chain. Here, note that, 
although electrons for unoccupied orbitals are 
localized on both sides in all El, E2, and E3 
orbitals, for occupied orbitals, electrons are in 
the barrier region in the G3 orbital. This suggests 
that the wave function tends to delocalize 
throughout the chain more in the valence 
band than in the conduction band, reproducing 
the condition in Fig. 18. 

y was calculated by AM1 molecular orbital 
calculation using the following formula: 


4 
e 
Y= 2 PgnPan'Pnin Pag X 
3 
4h 
n 
] 
(Wng — W)(Wn'g — 2w)Wng —w) 


The expression neglects antiresonant terms. 
This is because calculations were done at a 
deturning energy of 0.2eV from the first 
excited state, which induces a large resonant 
enhancement in y. As expected, y in the DQW 
is more than twice that in the SQW. This 
indicates that it may be possible to improve 
third-order optical nonlinearity by adjusting 
the well structures. 


6. Projections 

Research on artificial materials, like PDA or 
other conjugated systems with the MQWs above, 
has far-reaching consequences. If a way is devel- 
oped to stack a monomer on a monomer with 
direction control, the donor- and acceptor- 
substitution locations and the conjugated length 
could be controlled. Organic molecular beam 
epitaxy”, organic chemical vapor deposition 
under electric field*®’, and molecular layer 
deposition (MLD)*”, in which films grow 
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through self-terminated monomolecular growth 
steps, are promising techniques for making artifi- 
cial materials. It is also worthwhile to make 
actual EO devices using available pendant- 
attached polymers or molecular crystals. 

In addition to controlling the wave function 
as described in this paper, another approach, i.e. 
sharpening the absorption band?®, is also 
important for improving optical nonlinearities 
by resonant enhancement. The uniformity 
of the conjugated length in materials is 
important in sharpening the absorption band, 
as is reducing the exciton-phonon coupling 
strength. Bound excitons or exciton confine- 
ments in quantum wells in one-dimensional 
systems may be a way to sharpen absorption 
bands. 

Research on _ organic nonlinear optics 
involves two fields, physics and chemistry. 
A deficiency in either will halve progress. The 
author believes that, to succeed in developing 
the full potential of organic nonlinear materials 
and devices, close cooperation of physicists 
and chemists is essnetial. 


7. Conclusion 

The author investigated the use of organic 
materials in EO and all-optical devices, and 
looked for a way to make a breakthrough to 
a new understanding of nonlinear optical prop- 
erties. First, the author clarified the goal to 
fabricate practical devices for optical systems. 
It was found that second-order optical non- 
linearity must be improved by 10 times to 
100 times over that of LN and that third-order 
nonlinearity must be improved more than 
10 times over that of ordinary PDA. The authors 
derived qualitative guidelines for improving 
optical nonlinearities, i.e. that the balance 
between the overlap and separation (or differ- 
ence) of wave functions must be optimized. 
Based on these guidelines, the author designed 
séveral organic materials using ANCHOR??? 
From computer simulations, it was found that 
organic materials have the potential to reach 
the goal by controlling the wave function and 
forming wells through adjusting the donor 
and acceptor substitution sites. 
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The importance of computer-integrated manufacture (CIM) in the design and manufacturing 
of printed circuit boards (PC boards) is widely acknowledged. It is difficult to increase 
the efficiency of schematic design, pattern layout design, or manufacturing using CAD/CAM 
separately. This paper discusses the construction of a PC board CIM system for use as a key 
system in a company. It describes the background and scope of CIM, and explains the 
concept of CIM construction, showing examples. It also introduces the automated drawing 
input system for PC board design and the supporting system for mounting the parts. This 
system is advanced technology related to CIM construction. 


1. Introduction 

PC board CAD/CAM is a prerequisite for 
the construction of PC board CIM. This chapter 
gives a background on the use of PC board 
CAD/CAM., 

Competition is keen in the field of PC 
board design and manufacturing. Readily 
identifiable products are required and the 
time required for the entire process from 
product planning to shipment must be reduced 
to market products sooner. 

High-density mounting for PC_ boards 
using surface mounting technology has made 
considerable progress. The number of parts 
per PC board has increased, and the patterns 
have become more complex. At present, up to 
1600 parts are used for an analog PC board. 
In the near future, pattern layouts using about 
3000 parts will appear. The pattern is about 
0.1mm wide. This increase in the number 
of parts complicates the management of parts 
lists related to the calculation of weight and 
cost. It is difficult to manage the mounting 
of the parts, and manual processing is almost 
impossible. 

Separate systematization including 
schematic design and artwork has reached the 
limit for increasing efficiency and reducing 
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the time required for the entire process. The 
system is used for the entire process from 
schematic design to drilling, assembly, and 
testing the PC board. It is necessary to construct 
a PC board CIM system for the design and 
manufacture of PC boards. The system must 
integrate CAD/CAM systems that cover systems 
from the schematic design system to the 
manufacturing system (artwork, mounting, etc.). 
Also, it has to link the three departments: 
Design department using PC board CAD, 
production department, and manufacturing 
department. 

This paper discusses the basic ideas of 
constructing a PC board CIM system, and 
then explains the system of analog PC boards 
of the ICAD/PCBN® as an example. 


Note: ICAD/PCB (integrated Computer-Aided Design 
and manufacturing system/Printed Circuit 
Board) is the system for PC board design and 
manufacture in ICAD”, which is Fujitsu’s 
CAD/CAM system. Fujitsu developed this 
system based on the technology accumulated 
through designing PC boards for computers 
and communication equipment. Fujitsu has 
expanded the system in cooperation with its 
customers. 
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design 
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Fig. 1—Structure of PC board CIM system. 


2. Structure of the PC board CIM system 
Figure 1 shows the structure of the PC 
board CIM system. The core components 
of the system are the schematic design system 
and the pattern layout design system. These 
systems are closely connected with the informa- 
tion of nets and parts. The CAM systems are 
artwork, drilling, and mounting, etc. The PC 
board CIM must be linked to the other CAD/ 
CAM systems, such as the parts list system, 
simulation, and mechanical design and mounting. 
The ICAD/PCB almost covers these require- 
ments, and facilitates the construction of CIM. 


3. The basic idea of CIM construction 

The following four points are important 
for constructing a PC board CIM system. 
1) Supporting a flexible design flow 

It is essential not to fix the design flow 
of processes such as schematic design, pattern 
layout design, and PC board manufacture. 
The system must support a flexible design flow. 
2) Gradually applying the system 

The system should be implemented in stages, 
based on the flexible design flow, without 
changing the current system rapidly. 
3) Integration using a system link 

The system should integrate the related 
CAD/CAM systems, such as the parts list system 
and mechanical CAD system. 
4) Applying CAM to the assembly process 

The system should be able to support 
not only outputting NC data for artwork and 
drilling, but also the assembly and _ test 


processe 5”), 
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3.1 Supporting a flexible design flow 

PC boards are designed and manufactured 
in several stages, e.g., primary trial manufacture, 
secondary trial manufacture, trial manufacture 
for mass production, and mass _ production. 
This cycle is repeated in a short time. Schematic 
design often changes during pattern layout 
design. Pattern layout design may be started 
before schematic design is completed. 

The relationship between in-house design 
and subcontracted design also varies. The 
information passed to subcontractors may be 
rough schematic sketches or the schematic 
diagram and the net list generated by CAD. 
The subcontractors may deliver data in the 
gerber format after pattern layout or data of 
the PC board (patterns, parts, and net informa- 
tion), The design may change in the company 
while the subcontractors are working on the 
design. 

As indicated, schematic design and pattern 
layout design are usually parallel processes. 
In-house design is also performed in parallel 
with subcontracted design. An_ integrated 
system for PC board CIM must accommodate 
a flexible design flow and be able to collate 
design information. That is, the system must 
be able to collate schematic diagrams with 
pattern layouts. 

The patterns of design and _ verification 
generated by the ICAD/PCB are shown in 
Fig. 2. 

1) The information of nets and parts is ex- 
tracted after the schematic diagram is 
completed, and the pattern layout is started 
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using the extracted information. 
2) The pattern layout is designed first. When 
the schematic diagram is completed, the 


Schematic \,* Nets/ * parts 

eS eo Pattern layout 

[ Rough drawing / h drawin hea Schematic 
[soa rawina * Nets 
| 
L———-»{ Pattern layout 

Schematic * Nets : 
Pattern layout 


Added part name 
Reference to nets 


(Parts list pS 


‘en (Pattern layout layout 
ee Sgremati’) 


diagram 
Fig. 2—Flexible design flow and some collations. 


Collate =—(Pattern layout) =—(Pattern layout) 


(Other aie systems) 


(Pattern layout) (Pattern layout) 


Verification 


039 388 a obo 


Drawing of pattern layout 


information of nets is incorporated into 
the pattern layout. 

3) The pattern layout is designed using only 
the nets of the schematic diagram. Then, 
the part names are added while designing 
the pattern layout, and the related nets 
are incorporated when the parts are assigned. 

4) The information of parts is extracted from 
the parts list system, and the pattern layout 
is started using the extracted information. 

5) The pattern layout is designed without 
preparing a schematic diagram. 

6) The schematic diagram is collated with 
the pattern layout. 

7) The schematic diagram designed in the 
company is collated with the pattern layout 
designed by a subcontractor using another 
CAD system. 

8) The schematic diagram, pattern layout, 
and parts list are collated (a triple collation). 


3.2 Gradually applying the system 

The following method is generally used to 
design the pattern layout for PC boards. 

The pattern layout is drawn by subcon- 
tractors by hand. The drawing is then digitized 
with the CAD system because it is difficult 


c) Skeleton drawing 


Fig. 3—Three types of drawings prepared by hand. 
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to design analog pattern layout automatically. 
CIM construction will not succeed if the current 
design work is changed rapidly. It is important 
to introduce the system gradually, utilizing 
the current design work. The ICAD/PCB enables 
the separate use of individual subsystems, 
such as the schematic design system, pattern 
layout design system, and manufacturing sys- 
tem. 

This paper introduces the system for draw- 
ing input. This is an automated drawing input 
system for PC boards, for gradually applying 
the system considering the actual design condi- 
tions. With such a system, drawings of pattern 
layout are made by hand using a pencil and 
plotting paper. The drawings prepared by 
hand are classified into three types (see Fig. 3). 
1) Board outline 
2) Drawing containing part name and the 

cordinates (parts placement) 

3) Drawing of only the paths of the pattern 

(skeleton drawing) 

In the digitized system, these drawings 
are designed in the same way as drawings done 
by hand. This enables design to continue with- 
out placing a burden on the new system, These 
drawings are input by a scanner, and are 
converted to CAD data after segment and 
character recognition. For a skeleton drawing, 
the program automatically performs width 
adjustment and teardropping. In this way, 
the skeleton drawing is converted into the 
pattern drawing. 

The ICAD/PCB pattern layout editor must 
correct segments of the drawing done by hand 
that are indistinct and cannot be recognized 


Days required 


Digitizing system 


Automated drawing 
input system 


: 1 le- 
(Pattern generation) peed pie 
(Pattern & parts 
generation) 


Fig. 4—Days required for drawing input (example). 
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(about 2 percent in -normal operation), After 
the pattern layout data is generated, ICAD/PCB 
is used to complete the design by editing and 
modifying the data. The processing time, 
compared with that for the digitizing system, 
can be greatly reduced (see Fig. 4). For example, 
it is possible to reduce the time required by 
accepting drawings prepared by hand from 
subcontractors for use with the automated 
drawing input system. 


3.3 Integration using a system link 
It is important when constructing the PC 

board CIM system to integrate the related 

CAD/CAM systems. For this reason, the system 

should open the interface to read and write 

to the ICAD/PCB database. Use the database 
as the nucleus of the PC board CIM system or 
system linked with other systems. 

3.3.1 Parts list system 

The operations linking three systems and 

parts library have increased (see Fig. 5). 

For example, this link enables the following 
processing. 

1) Automatic parts selection considering the 
cost and delivery data based on _ the 
constants and rating determined by the 
schematic design 

2) Using the schematic diagram database 
to automatically generate a parts list 

3) Pattern layout design can also be started 
based on the parts information for each 
pattern layout. 

4) The following can be performed realtime 
during schematic design: 

i) Estimating the weight of the PC board 
ii) Calculating the cost 
iii) Estimating the rate of insertion 


Specifications 


a 
design 


—— 
Net-list 


Library 
Pattern layout 
design 


Fig. 5—System linked by parts list system. 
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Such information is useful for designers. 

3.3.2 Other CAD systems 

Many companies use a combination of 
two or more PC board CAD systems. Even 
if a company uses a single CAD system, related 
companies often use different CAD systems. 
The following are examples of the link required 
for the PC board CIM system. 
1) Acquiring information of artwork based 

on gerber data 
2) Collating the schematic diagrams designed 

in-house with the pattern layouts designed 

by subcontractors, based on a net file. 
3) Acquiring parts information of another 

CAD system based on the parts list. 

3.3.3, Mechanical CAD system 

As high-density mounting has progressed, 
the outlines of PC boards have become more 
complex. For this reason, the outlines of PC 
boards are usually designed by mechanical 
CAD, and the data is incorporated into a PC 
board CAD. The following method is under 
examination: To return the data to mechanical 
CAD after PC board design, and use it for me- 
chanical design. 


3.4 Applying CAM to the assembly process 
Constructing a PC board CIM system ex- 
pands the range of CAM, Formerly, the main 
process performed by CAM was outputting 
NC data for artwork and drilling. Now, to 
reduce the time required, CAM should be 
applied to the assembly process. In the assembly 
process, experts manually generate parts mount- 
ing data, checking the machine characteristics, 
CAD data, and parts information. This is done 
to attain the following: 
1) To prevent the destruction of parts 
2) To reduce the time taken for mounting 
parts 
3) To improve the ratio of machine operation 
In the future, the number of parts will 
increase because of high-density mounting. 
The manual generation of data will become 
difficult. Support by the CAM system will 
be indispensable. The supporting system for 
mounting the parts with the ICAD/PCB satisfies 
these requirements, and enables the automated 
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Table 1. Performance of parts mounting system 


(example) 
ae ae 
PC board time (hour) time (s) 
Expert 16 147 
a0 ICAD/PCB 3 : 148 
Expert 23 190 
oo ICAD/PCB 3 193 


generation of data in the assembly process. 
Table 1 shows an example that compares 
the work by an expert with the performance 
of the parts mounting system. This table 
indicates that the results of this system are 
equivalent to those of the expert. Furthermore, 
the time taken to generate data using this system 
is much less than the time taken by the expert. 


4. Advanced technology related to CIM 

construction 

The essential points of CIM construction 
have been explained above. This chapter 
concerns the technology of the automated 
drawing input system and the supporting system 
for mounting the parts, which have been 
mentioned as examples. 


4.1 Automated drawing input system for PC 
boards 
4.1.1 System outline 
To reduce the time required for initial 
input to CAD, an automated drawing input 
system that combines the ICAD/PCB and 
the FADCS (Fujitsu Advanced Drawing Capture 
System)” was developed for analog pattern 
layout. Priority was given to the following: 
1) Enabling accurate recognition 
2) Preventing CAD data from increasing 
Figure 6 shows the configuration of this 
system. First, the skeleton drawing and parts 
arrangement drawing based on the drawing 
notation is input using a scanner. Then, it is 
passed as image data to the recognition unit. 
This unit performs vectorization and character 
recognition. The vectorization process auto- 
matically selects a mode to extract the center 
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Fig. 6—Automated drawing input system for PC boards. 
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Fig. 7—Flow of pattern generation, 


line or contour of the pattern. The character 
recognition process identifies the characters 
and symbols in the configuration drawing. 
The vector data and the character recognition 
results are sent to the computer. 

The program for pattern generation 
processes the data processed by the recognition 
unit. The data is stored as pattern layout data 
in the ICAD/PCB. The ICAD/PCB_ checks 
the connection and clearance of the pattern 
layout data automatically. It confirms and 
corrects the data interactively, and designs 
the analog pattern layout. 

4.1.2 The pattern generation process 

To apply the automated drawing input 
system to PC board design, automatic pattern 
generation performs the processing shown in 
Fig. 7. Each step is outlined after the figure. 

1) Correcting coordinates 
Corrects dislocation or inclination on the 
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pattern drawing that occurs during input by 
the scanner. Adjusts the coordinates between 
the classified drawings (skeleton drawing, 
parts placement, and board outline). 
2) Deleting unnecessary data 

Before recognition, deletes the noise caused 
by dirt on the drawing or description errors, 
such as an irregular segment on a curve. 
3) Recognizing geometric figures 

To reduce the input pattern layout data, 
identifies straight lines, circles, and arcs based 
on the vector data obtained by the FADCS. 
Corrects their connections to identify them 
as smooth data. 
4) Processing data 

Reads the shape of the part indicated by 
the part number in the configuration drawing 
from the parts library of the ICAD/PCB. 
Generates the part data for the pattern layout 
based on the obtained information. At this 
time, generates a land at each pin position of 
the part, which is determined by the position 
and direction of the part. For a pattern path, 
rounds the endpoints and points of inflection 
at the center of the lands and generates through- 
holes at the open endpoints. In this way, makes 
adjustments between the drawings. Finally, 
adjusts the width of the identified pattern, 
planes the surface, and smoothes and teardrops 
the pattern. 
5) Storing the data into CAD 

Stores the data of the recognized and 
processed pattern and parts in the ICAD/PCB 
database. 


4.2 Supporting system for mounting the parts 
4.2.1 System outline 
The parts mounting system aims at reducing 

the time taken for parts mounting. The follow- 

ing algorithms for parts mounting with the 

ICAD/PCB are used for this purpose: 

1) Minimizing the movement of the head table 
(basic algorithm) 

2) Minimizing the movement of the cassette 

3) Minimizing the interference between the 
head and parts 

4) Minimizing the head rotation 

5) Minimizing the pitch change 
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Fig. 8—Flow of data generation for mounting. 


These algorithms can be combined, depend- 
ing on the inserter type. With the algorithm 
to minimize the interference, the head and parts 
are checked as three-diemensional shapes. 
To improve productivity, this system equalizes 
the operating time of each inserter in the 
assembly process. 

Figure 8 shows the configuration of this 
system, which performs the following: 

1) Parts classification 

Classifies all parts mounted on a PC board 
based on their shapes and specifications. 
2) Interference check 

Checks for interference between the parts 
and inserter head during parts mounting. 
3) Mounting priority processing 

Determines the mounting priority of the 
parts based on the parts placement. 
4) Parts distribution 

Distributes the parts so that the assembly 
time is equalized when the same type of inserter 
machine is used. 

5) Mounting sequence processing 

Determines the parts mounting sequence 

based on the specific algorithm for reducing 
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Fig. 9—Model of data generation for mounting. 


the time taken for mounting the parts. 
6) Verification and editing 

Verifies the mounting time and the im- 
balance in the operating time between the 
inserter machines, based on the mounting 
sequence and the operation characteristics of 
the inserter machines. 

4.2.2 Procedure for generating data for 

mounting 

This section explains the procedure for 
generating data for mounting, using the model 
shown in Fig. 9 as an example. 

The model consists of two inserter machines 
of the same type. Five types of parts and eleven 
parts are specified. The mounting tact of each 
machine is 0.8 s and 0.4 s per part. 

First, an interference check is made based 
on the following information: 

1) Placement of the parts mounted on the PC 
board 

2) Head shape when the head catches on a part 
registered in the machine library 

3) Shape of parts in the parts library 

When parts Al and B1 will interfere, the 
mounting priority processing generates informa- 
tion in the part priority table whereby part B1 
has priority over part Al. 

The distribution of parts types is based 
on the quantity of parts and the placement 
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information. For parts distribution processing, 
the parts priority and the performance ratio 
of the mounting tact are considered. Since 
the tact ratio of the two inserter machines 
is 2:1, the parts distribution ratio is set to 1:2. 
The distribution results are generated in the 
sequence table for parts mounting. After the 
parts have been assigned to the _ inserter 
machines, the mounting sequence is determined 
for each sequence table, considering the parts 
placement information, parts priority, and 
operating characteristics of the machines. 

Finally, the mounting time for each machine 
is verified to equalize the operating time. 

This model uses two inserter machines of 
the same type. Any number of machines can be 
used. 


5. Conclusion 

Technological innovation is keen in the field 
of PC board design and manufacturing. It is 
necessary to reduce the time required for 
these processes. For this reason, the demand 
for the PC board CIM system will increase. 

This paper discussed the basic idea of 
the PC board CIM system, giving examples of 
the ICAD/PCB. It is important to use the 
CAD/CAM system as the nucleus in this system. 
It is also important to establish the database 
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management technology used for centrally 
managing the technical information, such as 
parts lists, parts attributes, drawings, and 
technical documents. It is also necessary to 
establish the network technology that enables 
information to be transferred smoothly between 
related departments. 

Fujitsu will acquire the related technology 
and enhance the ICAD/PCB as the nucleus 
of the PC board CIM system. 
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High-Speed Structural Analysis Program: 
POPLAS*/ FEM5 on Supercomputer 


@ Yasuhiro Komori @ Katsuhiko Akahori (Manuscript received August 30, 1990) 


This paper describes the high-speed processing achieved by the development of the structural 
analysis program POPLAS©®/FEM5 (called FEMS) on the multiple-pipeline architecture of 
the FUJITSU VP-series supercomputer. 

The CPU time has been reduced by using a method of solving simultaneous linear equations 
that takes advantage of performance capabilities of a supercomputer. This method is com- 
bined with a matrix column-row exchange method. Concentrated vector tuning is also used. 
The |/O time has been reduced by developing original I/O processing. As a result, FEM5 has 


achieved superlative high-level, high-speed processing in the structural analysis field. 


1. Introduction 

With the increase of more advanced, more 
complicated products in recent years, the range 
of processes subject to simulations in product 
development has extended. The scale of these 
processes has become larger, and the need for 
supercomputers!),2) has grown. In the structural 
analysis field, there is a great demand to reduce 
the time necessary for product development. 
For example, company A, a Fujitsu VP user, 
asked Fujitsu to improve the performance of 
structural analysis program FEMS based on 
finite element method, with a technique that 
utilizes the performance of the FUJITSU 
VP-series, so that FEMS can solve large-scale 
problems. 

Fujitsu has developed and provided FEMS, 
which operates on computers ranging from super- 
computers to workstations. Two hundred and 
fifty copies of FEMS have been distributed. 

For our development of high-speed process- 
ing by FEM5 on a supercomputer, the target 
performance for FEM5 was specified based on 
the requirements of company A as follows: 
FEMS5 can solve a problem with about 15 000 
degrees of freedom (matrix size: 15 000 x 15 000) 
requiring CPU time and_ elapsed time 
that are 0.8 to 1.2 times as long as those taken 
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for the general-purpose structural analysis pro- 
gram ‘‘a’’, the famous fastest general-purpose 


program. 


2. Problems 

In converting a commercial structural analy- 
sis program for use on a supercomputer, the 
scalar version of the source program, the version 
which is used on general-purpose computers, is 
copied as is onto the supercomputer. Then this 
source code is vectorized using an automatic 
vectorizing FORTRAN compiler. Alternatively, 
more often the source program is subjected to 
vector tuning by hand and tested. 

An algorithm such as that for solving simul- 
taneous linear equations, exchanging the 
columns and rows of a matrix, etc., that fully 
takes advantage of the performance of a super- 
computer has not been applied to a commercial 
structural program. Also, no programming 
technique that takes full advantage of the per- 
formance capabilities has been examined on the 
application program side. The CPU time and I/O 
time have not been reduced effectively. 


2.1 Reduction of CPU time 
Effectively reducing the CPU time on a 
supercomputer requires the following: 
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1) Employing a method of solving simultaneous 
linear equations that takes advantage of the 
performance of the supercomputer CPU 

2) Combining the solution method with a ma- 
trix column-row exchange method to maxi- 
mize the performance 

3) Improving the vectorization ratio through 
concentrated vector tuning efforts 


2.2 Reduction of I/O time 
To reduce I/O time effectively on a super- 

computer, a new method for an application pro- 

gram must be developed for the following rea- 
sons: 

1) There is a method that depends on the spe- 
cific hardware, whereas none of the methods 
operate on the standard hardware configura- 
tion. 

2) There is no general method to reduce the 
I/O time. For each application, the best 
method must be developed with the charac- 
teristics of the application taken into ac- 
count. 


3. Developed techniques 
The following new techniques have been 
developed. They reduce the CPU time and I/O 
time to make full use of the performance of a 
supercomputer, instead of simply using the sca- 
lar version of the source program of FEM5 on 
the supercomputer. 
1) Block skyline method 
2) GPS method (Norman E. Gibbs, William 
G. Poole. Jr, and Paul K. Stockmeyer meth- 
od)? combined with the block skyline 
method 
3) Concentrated vector tuning 
4) Design of parallel I/O functions and combi- 
nation with the block skyline method 


3.1 Choice of the block skyline method 

There are some methods for solving linear 
simultaneous equations, such as the band matrix 
method, the block skyline method, the active 
column method, and the CG method (the conju- 
gate gradient method). The block skyline meth- 
od and the CG method are known for taking 
advantage of the CPU performance of a super- 
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computer. 

The algorithm for the block skyline method 
solves linear simultaneous equations while re- 
ducing the related columns for each column of 
the matrix. When the block skyline method is 
written in FORTRAN, a multi nested Do-loop 
source program is obtained. When the FOR- 
TRAN program is compiled with the automatic 
vectorizing FORTRAN compiler for a super- 
computer, an object module with an extremely 
high vectorization ratio is obtained. 

The CG method also takes advantage of the 
CPU performance of a supercomputer. A proto- 
type was produced and evaluated with a real 
problem (honeycomb-structure car parts). Al- 
though the prototype sometimes resulted in 
outstanding CPU performance, the results 
showed that unstable convergence occurred for 
some data. Therefore, the CG method was not 
adopted for a commercial general-purpose pro- 
gram. 

This is why the block skyline method was 
adopted. The block skyline method is outlined 
below: 

1) Only the upper right triangular portion of a 
matrix is handled, because matrices handled 
in linear structure analysis are symmetric. 

2) The area up to the nonzero element (called 
the skyline) that is farthest from the diago- 


Skyline 


Symmetric 


Fig. 1—Skyline. 


FUJITSU Sci. Tech, J., 27, 1, (April 1991) 


Y. Komori, and K. Akahori: High-Speed Structural Analysis Program: POPLAS®/FEMS on Supercomputer 


nal element is handled (see Fig. 1). 3.2 GPS method combined with block skyline 
3) Forward reduction and back-substitution are method 

performed for each column of a matrix (see Only nonzero elements of the matrix proc- 

Fig. 2). 


4) I/O operations are performed between an 
external storage unit and main storage for 
each group of columns (called a block) (see 


Fig. 3). 
Column 7 
Block Block Block Block! 
(e.g. foward reduction for the 7th column) External 
storage 
Fig. 2— Operation for each column in a matrix. unit 


Fig. 3—I/O processing for each block. 


Symmetric 


Symmetric 


@: Nonzero element 
a) Before matrix column-row exchange b) After matrix column-row exchange 


Fig. 4—Matrix column-row exchange. 
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GPS minimizes the area between the skyline 
and diagonal elements 


Fig. 5—Feature of the GPS method. 


essed in structural analysis can be collected near 

the diagonal element in a column-row exchange 

(see Fig. 4). Some column-row exchange meth- 

ods are available. To reduce the CPU time and 

I/O time effectively, a method fit for the simul- 

taneous linear equations solver is needed. 

The algorithm of the GPS method is charac- 
terized by its ability to minimize the area bet- 
ween the skyline and the diagonal element. This 
characteristic enables a matrix to be generated 
so that the number of operations in the block 
skyline method and the quantity of I/O data 
between main storage and the external storage 
unit can be minimized (see Fig. 5). Therefore, 
the GPS method was adopted as the method 
that could maximize the performance of the 
block skyline method. 

GPS processing is outlined below: 

1) The apexes of both ends in the longitudinal 
direction of element division are selected. 

2) The level (distance from the apex) of each 
node is determined. 

3) The level structure that minimizes the num- 
ber of nodes of the same level is selected by 
repeating steps 1) and 2). 

4) Nodes are numbered in the order of the 
level. 
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ai: Block skyline method part 


Fig. 6—Distribution of CPU time. 


3.3 Concentrated vector tuning 

Analysis of the CPU time using a real prob- 
lem shows that, for a problem whose matrix 
size is about 15000 x 15 000, about three per- 
cent (block skyline method part: 4 ksteps) of 
the entire source code takes up about 95 percent 
of the total CPU time in a scalar computer 
FUJITSU M-380 (see Fig. 6). 

Therefore, the CPU time can be greatly re- 
duced by concentrating vector tuning efforts on 
the block skyline method part of the source 
code (For example, expanding DO-loops, ex- 
changing indices of nested DO-loops). 

In the final results, the ratio of the total 
CPU time of M-380 to that of FUJITSU VP200 
was 100 to 12 (see Fig. 6). This ratio exactly 
represents the effect of vectorization because 
the VP200 has the same scalar performance as 
the M-380. 


3.4 Parallel I/O function combined with block 

skyline method 

In the block skyline method, the data (two 
blocks) necessary for forward reduction and 
back substitution which is performed for each 
column of a matrix is stored in a work area in 
main storage. If the data is in more than two 
blocks, it is necessary to transfer them to an ex- 
ternal storage unit to perform direct I/O proc- 
essing of each block, and to transfer the blocks 
necessary for operation to main storage (see 
Fig. 7). For a problem whose matrix size is 
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Block skyline method 


Forward 


reduction —— f 


Block A 


Back 


substitution ' 


The direct I/O 


> Block B 
processing of each 
block is necessary 
Two blocks exist 


in main storage t 


Main storage 


External 
storage unit 


<5 


Asynchronous and 
simultaneous I/O 
processing 


Fig. 7—Block skyline method and parallel I/O method. 


Unit of I/O 
: processing 
Unit of parallel 1/O 


by BDAM 
Record p 
length 


Block 


BDAM : basic direct access method 


Fig. 8—Block division. 


about 15 000 x 15 000, the area between the sky- 
line and the diagonal element is 40 Mbytes and 
much I/O data is generated. 

The block skyline method is characterized 
by its I/O operation which is performed in block 
units. This characteristic was used to develop an 
original parallel I/O function (see Fig. 7). This 
function divides a block into subblocks and per- 
forms subblock I/O operations for multiple ex- 
ternal storage units simultaneously and asyn- 
chronously. 

Figure 8 shows how a block is divided. 
Figure 9 shows the processing performed by the 
parallel I/O routine. 

Because I/O operations are performed simul- 
taneously for multiple external storage units, 
the number of subblocks that constitute a block 
and number of paths for the channels that are to 
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i=\1,n 


READ of Xi 
READ of Yi 
READ of Zi 


CHECK 
CHECK 
CHECK 


Fig. 9—Parallel I/O routine. 


operate concurrently must be increased. Inde- 
pendent channel paths must be assigned to the 
storage units. Otherwise channels will compete 
and the I/O time will not be effectively reduced. 

In FEM5, the number of subblocks varies 
from | to 10 so that FEM5 can flexibly adapt 
itself to the hardware configuration of a user 
system. In actual user operations, the number is 
usually 4. 


4. Results 

The FEMS program developed using the 
above methods has been tested with seven sam- 
ples of car structural analysis data (matrix size: 
5202 x 5202 to 21826 x 21826) on the 
FUJITSU VPI100 system of company A. The 
results show that the CPU time is 18 minutes and 
8 seconds and that the elapsed time is 2 hours 
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ee 0 0.5 1.0 


Program “aq” + L n a ae: es es ey ee 


CPU FEM5 18min 8s 


time : 
Program “q” 25 min 37s 


Elapsed} FEM5 2h 26min | 
time Program “a” 2h 45 min 


Computer: VP100 
A total of seven samples of data 
(matrix size: 5202 x 5202 to 21826 x 21826) 


Fig. 10—Survey data of CPU time and elapsed time. 


26 minutes (total of seven samples of data). On 
the other hand, program “‘a’’ required the CPU 
time of 25 minutes 37 seconds, and the elapsed 
time of 2 hours 45 minutes for the same samples 
of data. Thus, assuming the CPU and elapsed 
time for program “‘a’’ to be 1, respectively, the 
results show a CPU time of 0.7, and elapsed time 
of 0.9. These values satisfy the requirements (see 
Fig. 10). As of August 1990, the CPU time and 
the elapsed time are less than those obtained by 
any other general-purpose finite element analysis 
program in the world. 


5. Conclusion 
The development of FEMS has reduced the 
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CPU time and I/O time resulting in high-speed 
processing by FEMS on a supercomputer. To 
reduce the I/O time, a new parallel I/O function 
has also been developed. The application of the 
parallel I/O function to other programs has re- 
vealed that it can be used as a general-purpose 
function to reduce the I/O time in a batch type 
technical calculation program. The same func- 
tion has been available as a high-speed I/O func- 
tion in the library of the Fujitsu FORTRAN77 
EX V11 compiler for general users since Sep- 
tember 1990. 

Fujitsu plans to continue this technical de- 
velopment according to the performance of the 
supercomputer hardware functions (such as ex- 
panded high-speed data space and virtual I/O 
features), and to achieve faster processing. 
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(886-2)-713-5396 


(82-417)-581-0700 
(86-591)-560022 
(1-408)-432-1318 
(61-2)-411-8603 
(61-2)-975-2899 
(1-714)-630-7660 


(1-602)-921-5999 
(1-416)-6 73-8677 


(60-7)-481-771 
(1-408)-428-0640 
(1-619)-457-9968 


(49-89)-323-78100 
(55-11)-885-9132 


(34-1)-581-8300 
(44-8 1)-573-2643 
(44-8 1)-756-0286 
(44-8 1)-573-2643 
(33-1)-4-399-0700 
(852)-721724 
(1-203)-796-5665 or 
(1-203)-796-5723 


(31-20)-427675 


(39-2)-657-2257 
(82-2)-739-3294 


(65)-265-6275 


(1-408)-432-9044 
(353-1)-520539 


(39-2)-824-6 189 
(44)-628-781484 
(60-3)-511-1227 
(852-3)-721-6555 
(49-6103)-690-122 
(1-919)-790-8376 
(64-4)-733-429 


(46-8)-28-0345 
(63-2)-817-7576 


(65)-225-5075 
(1-408)-492-1982 
(1-619)-259-2603 


(66-2)-529-2581 


(55-27)-225-0954 
(61-3)-670-6183 


(1-303)-682-6401 
(886-2)-717-4644 


(The information here as of April 5, 1991) 


FUJITSU LIMITED 


6-1, Marunouchi 1-chome, Chiyoda-ku, Tokyo 100, Japan 
Phone: National (03) 3216-3211 International (Int' Prefix) 81-3-3216-321% Telex: J22833 Cable: “FUJITSULIMITED TOKYO” 


